Get the job you really want.

Top Reliability Engineer Jobs in San Francisco, CA

24 Days Ago
San Francisco Bay Area, CA
Remote
51 Employees
Mid level
51 Employees
Mid level
Cloud • Software
As a Site Reliability Engineer at RunPod, you will design, implement, and maintain scalable and highly available systems, troubleshoot complex issues, manage large-scale infrastructure, and automate processes to ensure reliability and performance.
24 Days Ago
San Francisco Bay Area, CA
Remote
51 Employees
Senior level
51 Employees
Senior level
Cloud • Software
The SRE Manager will lead and mentor a team of Site Reliability Engineers, overseeing the design and maintenance of distributed systems, ensuring reliability and scalability, and managing infrastructure security. Responsibilities include strategic planning, establishing SLIs, SLOs, and SLAs, and collaborating with cross-functional teams to meet organizational goals.
4 Days Ago
San Francisco Bay Area, CA
Remote
300 Employees
122K-139K Annually
Mid level
300 Employees
122K-139K Annually
Mid level
Edtech • Fintech • Information Technology • Software • Financial Services
As a Site Reliability Engineer II, you will enhance site and system reliability through monitoring, automation, and infrastructure management. Your role includes managing IaC, ensuring performance metrics are met, and collaborating across teams while participating in on-call rotations to address incidents.
Top Benefits:
401-K Matching
Commuter Benefits
Continuing Education Stipend
+4 More
5 Days Ago
San Francisco Bay Area, CA
467 Employees
Senior level
467 Employees
Senior level
Information Technology • Machine Learning • Software • Analytics
Invisible Technologies is seeking a Principal Software Engineer specializing in SRE/DevOps. This role involves leading technical initiatives, mentoring team members, and developing cloud-based architecture while ensuring security and networking considerations are met. Candidates should have strong experience with Kubernetes, cloud providers like AWS and GCP, and infrastructure as code tools such as Terraform and Ansible.
8 Days Ago
San Francisco Bay Area, CA
1,320 Employees
Senior level
1,320 Employees
Senior level
Fintech
The Staff Software Site Reliability Engineer will lead incident management, oversee change and problem management processes, develop reliability engineering tools, and promote SRE best practices across teams, ensuring system reliability and stability at Credit Karma.
Top Benefits:
401-K Matching
Child Care Benefits
Commuter Benefits
+39 More
6 Days Ago
San Francisco Bay Area, CA
Remote
1,900 Employees
207K-289K Annually
Senior level
1,900 Employees
207K-289K Annually
Senior level
Information Technology • Mobile • News + Entertainment • Social Media
As a Staff Software Engineer on the Compute Reliability and Efficiency team, you will focus on Linux and Kubernetes systems engineering, enhancing the performance and reliability of Reddit's infrastructure and tools. You will write and design software for availability and efficiency, collaborate with engineers, and automate development processes.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+46 More
6 Days Ago
San Francisco Bay Area, CA
Remote
1,900 Employees
207K-289K Annually
Senior level
1,900 Employees
207K-289K Annually
Senior level
Information Technology • Mobile • News + Entertainment • Social Media
The Staff Software Engineer will focus on lower-level systems engineering, particularly in Linux and Kubernetes, to enhance the performance, reliability, and scalability of Reddit's infrastructure, while collaborating with other engineers to automate and improve critical processes.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+46 More
7 Days Ago
San Francisco Bay Area, CA
86 Employees
Senior level
86 Employees
Senior level
Artificial Intelligence • Robotics • Automation • Manufacturing
The Senior Reliability Test Engineer will design and execute reliability test plans for humanoid robots, focusing on modules like actuators, batteries, and sensors. Responsibilities include developing specifications, conducting electrical diagnostics, and utilizing CAD for test fixture designs, with a strong requirement for Python scripting to automate testing processes.

Featured Jobs

7 Days Ago
San Francisco Bay Area, CA
160 Employees
180K-230K Annually
Senior level
160 Employees
180K-230K Annually
Senior level
Consumer Web
In this role, you will build and maintain scalable infrastructure ensuring reliability and low-latency experiences, collaborating with leadership and software developers to implement best practices in cloud technologies and infrastructure management.
7 Days Ago
San Francisco Bay Area, CA
237 Employees
160K-250K Annually
Mid level
237 Employees
160K-250K Annually
Mid level
Artificial Intelligence • Cloud • Software
As a Senior Site Reliability Engineer, you will automate processes, improve team workflows, manage deployment tools, and maintain secure infrastructure while working with a diverse technology stack in a hybrid cloud environment. You will be responsible for monitoring and improving system performance and ensuring the reliability of enterprise SaaS offerings.
8 Days Ago
San Francisco Bay Area, CA
Remote
1,500 Employees
Mid level
1,500 Employees
Mid level
Cloud • Information Technology • Productivity • Software • Automation
As a Site Reliability Engineer at Boomi, you will develop systems and software to meet customer goals. Your responsibilities include maintaining infrastructure as code, ensuring system reliability, improving operational processes, and collaborating on product features while participating in on-call rotations and monitoring systems.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+24 More
8 Days Ago
San Francisco Bay Area, CA
Remote
1,500 Employees
Mid level
1,500 Employees
Mid level
Cloud • Information Technology • Productivity • Software • Automation
As a Site Reliability Engineer at Boomi, you'll develop sophisticated systems and software, working collaboratively in an Agile team. You'll ensure the reliability of production systems and work on infrastructure management, observability, and automation processes, while actively participating in on-call rotations and improving operational workflows.
Top Benefits:
401-K
401-K Matching
Adoption Assistance
+24 More
18 Days Ago
San Francisco Bay Area, CA
69 Employees
191K-239K Annually
Expert/Leader
69 Employees
191K-239K Annually
Expert/Leader
News + Entertainment
The Staff Site Reliability Engineer for the Data Engineering team will maintain and enhance the reliability of data infrastructure, collaborating closely with engineers to implement automation, monitoring, and best practices for data platform performance.
22 Days Ago
San Francisco Bay Area, CA
Remote
95 Employees
185K-225K Annually
Senior level
95 Employees
185K-225K Annually
Senior level
Software
As a Senior Platform Engineer at Mux, you will design and operate infrastructure for high traffic platforms, improve usability and automation for CI/CD systems, lead cross-functional projects, debug production issues, and establish engineering standards.
Top Benefits:
401-K
Adoption Assistance
Commuter Benefits
+42 More
All Filters
Date Posted
Job Category
Experience
Industry
Company Name
Company Size