Get the job you really want.

Top Reliability Engineer Jobs in San Francisco, CA

4 Days Ago
San Francisco Bay Area, CA
Hybrid
1,900 Employees
160K-213K Annually
Senior level
1,900 Employees
160K-213K Annually
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
The Staff Site Reliability Engineer oversees performance aspects of production applications, developing frameworks for testing and maintaining capacity plans. They improve the SaaS service experience by optimizing code and automating issue remediation, while leading projects and mentoring engineers within a collaborative team environment.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+53 More
10 Days Ago
San Francisco Bay Area, CA
Remote
3,300 Employees
137K-218K Annually
Senior level
3,300 Employees
137K-218K Annually
Senior level
Information Technology • Security • Cybersecurity
The Network Reliability Engineer will enhance network resilience by engineering solutions for operations at Cloudflare's core data center network. Responsibilities include managing network hardware/software, automating operational tasks, and leading system design projects.
Top Benefits:
401-K
Child Care Benefits
Commuter Benefits
+45 More
2 Days Ago
San Francisco Bay Area, CA
Hybrid
1,050 Employees
Senior level
1,050 Employees
Senior level
Cloud • Software
The Senior Site Reliability Engineer will ensure reliability in global monitoring infrastructure, focusing on availability, performance, and growth. Responsibilities include collaborating with software teams, designing deployment models, automating processes, debugging issues, and participating in on-call rotations to enhance incident response capabilities.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
4 Days Ago
San Francisco Bay Area, CA
Remote
9,500 Employees
167K-269K Annually
Expert/Leader
9,500 Employees
167K-269K Annually
Expert/Leader
Cloud • Information Technology • Productivity • Security • Software
The Principal Site Reliability Engineer at Atlassian will enhance cloud service reliability, optimize operational efficiencies, and lead cross-functional initiatives while mentoring engineers. They will leverage deep expertise in cloud infrastructure and high-availability software management to advocate for reliability practices across teams.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+58 More
2 Days Ago
San Francisco Bay Area, CA
Remote
7,500 Employees
135K-215K Annually
Mid level
7,500 Employees
135K-215K Annually
Mid level
Cloud • Information Technology • Sales • Security • Cybersecurity
As a Senior Software Engineer at CrowdStrike, you'll develop and maintain reliable and scalable services, enhance monitoring systems, and improve architecture for a cloud-native security platform. You'll collaborate across teams and mentor other developers while promoting best practices, particularly with Go.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+44 More
3 Days Ago
San Francisco Bay Area, CA
Remote
9,500 Employees
Junior
9,500 Employees
Junior
Cloud • Information Technology • Productivity • Security • Software
As a Site Reliability Engineer at Atlassian, you'll focus on enhancing cloud service scalability, reliability, and performance. You'll collaborate with a team to manage caching infrastructure, and automation, while improving code and debugging applications in a high-availability environment.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+58 More
4 Days Ago
San Francisco Bay Area, CA
Hybrid
1,900 Employees
145K-193K Annually
Senior level
1,900 Employees
145K-193K Annually
Senior level
Cloud • Fintech • Information Technology • Machine Learning • Software
The Senior Site Reliability Engineer is responsible for ensuring the optimal performance and availability of BlackLine's services and infrastructure. This role involves capacity planning, responding to customer escalations, identifying performance issues, maintaining metric frameworks, and collaborating with development teams to improve application performance and security.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+53 More
4 Days Ago
San Francisco Bay Area, CA
Remote
3,000 Employees
126K-185K Annually
Senior level
3,000 Employees
126K-185K Annually
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI
As a Senior Site Reliability Engineer at Cisco Meraki, you will facilitate cloud adoption, build and maintain infrastructure as code modules, and collaborate on compliance and security practices, ensuring robust and compliant cloud services for the company.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+88 More

Featured Jobs

4 Days Ago
San Francisco Bay Area, CA
Remote
3,000 Employees
126K-185K Annually
Senior level
3,000 Employees
126K-185K Annually
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI
As a Senior Site Reliability Engineer, you will improve the developer experience for Cloud Engineering teams by designing and evolving infrastructure for cloud applications, resolving complex problems, influencing operational excellence, and collaborating across teams. You will lead incident responses and contribute to sustainable practices.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+88 More
6 Days Ago
San Francisco Bay Area, CA
1,200 Employees
191K-239K Annually
Senior level
1,200 Employees
191K-239K Annually
Senior level
Digital Media • eCommerce • Gaming • Mobile
As a Staff Site Reliability Engineer, you will enhance the reliability of Crunchyroll's data infrastructure, focusing on automation, monitoring, alerting, and working closely with data engineers to ensure efficient data services. Your efforts will directly impact the availability and performance of services for millions of fans worldwide.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+58 More
6 Days Ago
San Francisco Bay Area, CA
1,700 Employees
130K-280K Annually
Junior
1,700 Employees
130K-280K Annually
Junior
Cloud • Hardware • Security • Software
The Site Reliability Engineer will maintain and improve infrastructure automation, manage scaling, and monitor infrastructure to ensure reliability. Responsibilities also include defining the infrastructure roadmap and providing technical support for engineering teams.
Top Benefits:
401-K
Commuter Benefits
Company Equity
+45 More
6 Days Ago
San Francisco Bay Area, CA
Hybrid
1,050 Employees
Senior level
1,050 Employees
Senior level
Cloud • Software
The Principal Site Reliability Engineer will focus on the operational excellence of datastores, ensuring reliability, availability, and performance. Responsibilities include building and supporting mission-critical datastores and collaborating with engineering and product management teams to innovate solutions. The role demands a strong technical vision and expertise in automating scalable systems.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+52 More
7 Days Ago
San Francisco Bay Area, CA
2,500 Employees
189K-234K Annually
Senior level
2,500 Employees
189K-234K Annually
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3
As a Senior Software Engineer at Roblox, you'll focus on improving network reliability and efficiency by developing automation systems, managing network operations, and collaborating cross-functionally with infrastructure teams. You'll address technical challenges at scale and participate in an on-call rotation.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+46 More
7 Days Ago
San Francisco Bay Area, CA
Hybrid
2,500 Employees
234K-284K Annually
Senior level
2,500 Employees
234K-284K Annually
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3
As a Senior Site Reliability Engineer, you will build and support the infrastructure for Roblox's private cloud, focusing on orchestration systems, service discovery, and performance monitoring. You will automate processes, create fault-tolerant systems, and analyze system designs to ensure reliability and production readiness.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+46 More
7 Days Ago
San Francisco Bay Area, CA
2,500 Employees
316K-384K Annually
Senior level
2,500 Employees
316K-384K Annually
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3
The Engineering Manager/Senior Engineering Manager leads the Compute infrastructure team, enhancing system reliability and managing production health. Responsibilities include collaborating across functions, building robust infrastructure, and driving projects that improve scalability and performance. A successful candidate will have extensive experience in engineering management and a solid software engineering background.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+46 More
7 Days Ago
San Francisco Bay Area, CA
2,500 Employees
219K-284K Annually
Senior level
2,500 Employees
219K-284K Annually
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3
As a Senior Software Engineer on the Reliability team at Roblox, you will enhance engine stability and reliability by developing software solutions to monitor performance and crash metrics, mitigating incidents, and automating processes for improved reliability. You will work on various applications across platforms, collaborate with a team, and engage in a call rotation for incident management.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+46 More
7 Days Ago
San Francisco Bay Area, CA
450 Employees
180K-225K Annually
Senior level
450 Employees
180K-225K Annually
Senior level
Cloud • Greentech • Other • Energy
As a Senior Site Reliability Engineer at Crusoe, you will ensure the reliability and performance of infrastructure by detecting, analyzing, and preventing issues, while automating processes and collaborating with engineering teams. Your role includes monitoring system metrics, incident response, and driving continuous improvement based on customer needs.
Top Benefits:
401-K
401-K Matching
Commuter Benefits
+34 More
8 Days Ago
San Francisco Bay Area, CA
Remote
200 Employees
Senior level
200 Employees
Senior level
eCommerce • Software • Design • SEO
As a Senior Site Reliability Engineer at Webflow, you will enhance the reliability of customer-facing infrastructure, improve observability practices, optimize applications in Kubernetes, and work closely with various teams to ensure platform stability and security for millions of users.
Top Benefits:
401-K
Commuter Benefits
Company Equity
+36 More
9 Days Ago
San Francisco Bay Area, CA
Hybrid
2,500 Employees
284K-332K Annually
Senior level
2,500 Employees
284K-332K Annually
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3
As a Principal Software Engineer at Roblox, you will design automation and reliability systems for the global network infrastructure, lead projects, collaborate cross-functionally, and contribute to a strong engineering culture. Responsibilities include building cutting edge systems and participating in on-call rotations.
Top Benefits:
401-K
401-K Matching
Child Care Benefits
+46 More
10 Days Ago
San Francisco Bay Area, CA
Remote
3,000 Employees
173K-242K Annually
Senior level
3,000 Employees
173K-242K Annually
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI
The Lead Site Reliability Engineer will design, implement, and lead a highly available orchestration platform, influence architectural decisions focusing on security and performance, build documentation for automation and resiliency, and support cloud technologies. The role involves mentoring, leading projects, and participating in a 24x7 on-call rotation.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+88 More
10 Days Ago
San Francisco Bay Area, CA
Remote
9,500 Employees
Junior
9,500 Employees
Junior
Cloud • Information Technology • Productivity • Security • Software
The Site Reliability Engineer will work with the SRE team to manage and improve caching infrastructure and automation for Atlassian's Cloud products. Responsibilities include ensuring high availability and scalability of services, debugging and improving code, and automating routine tasks while collaborating with various teams.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+58 More
12 Days Ago
San Francisco Bay Area, CA
Remote
3,000 Employees
95K-153K Annually
Mid level
3,000 Employees
95K-153K Annually
Mid level
Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI
The Site Reliability Engineer will maintain the reliability, performance, and scalability of production systems, collaborating with various teams to ensure availability and compliance. Key responsibilities include implementing robust monitoring systems, participating in audits, ensuring industry best practices, and promoting automation processes.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+88 More
12 Days Ago
San Francisco Bay Area, CA
Remote
3,000 Employees
173K-242K Annually
Senior level
3,000 Employees
173K-242K Annually
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI
As a Lead Site Reliability Engineer, you will design, develop, and operate a secure cloud platform. Responsibilities include enabling cloud adoption for teams, managing costs, building automated reporting capabilities, maintaining infrastructure as code, and collaborating on cloud strategy and compliance.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+88 More
7 Days Ago
San Francisco Bay Area, CA
86 Employees
Senior level
86 Employees
Senior level
Artificial Intelligence • Robotics • Automation • Manufacturing
The Senior Reliability Engineer will design and execute test plans for humanoid robots, ensuring reliability and durability by analyzing failures and providing data-driven recommendations to design teams. Responsibilities include conducting accelerated life tests, collaborating with hardware engineers, documenting failures, and supporting failure analysis efforts.
20 Days Ago
San Francisco Bay Area, CA
Remote
1,500 Employees
160K-222K Annually
Senior level
1,500 Employees
160K-222K Annually
Senior level
Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software
As a Senior Site Reliability Engineer at Upstart, you will enhance the reliability and performance of our production systems. You'll implement monitoring standards, improve incident response practices, and automate operations to support a high-quality customer experience. This role requires collaboration with teams to enhance system effectiveness.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+56 More
All Filters
Date Posted
Job Category
Experience
Industry
Company Name
Company Size