Get the job you really want.

Top Reliability Engineer Jobs in San Francisco, CA

Blackline

Staff I Reliability Engineer - FedRAMP

4 Days Ago

San Francisco Bay Area, CA

Hybrid

1,900 Employees

160K-213K Annually

Senior level

Apply

1,900 Employees

160K-213K Annually

Senior level

Cloud • Fintech • Information Technology • Machine Learning • Software

The Staff Site Reliability Engineer oversees performance aspects of production applications, developing frameworks for testing and maintaining capacity plans. They improve the SaaS service experience by optimizing code and automating issue remediation, while leading projects and mentoring engineers within a collaborative team environment.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+53 More

CloudFlare

Network Reliability Engineer

10 Days Ago

San Francisco Bay Area, CA

Remote

3,300 Employees

137K-218K Annually

Senior level

Apply

3,300 Employees

137K-218K Annually

Senior level

Information Technology • Security • Cybersecurity

The Network Reliability Engineer will enhance network resilience by engineering solutions for operations at Cloudflare's core data center network. Responsibilities include managing network hardware/software, automating operational tasks, and leading system design projects.

Top Benefits:

401-K

Child Care Benefits

Commuter Benefits

+45 More

Cisco ThousandEyes

Senior Site Reliability Engineer I (Python/Golang), Agent Ops

2 Days Ago

San Francisco Bay Area, CA

Hybrid

1,050 Employees

Senior level

Easy Apply

1,050 Employees

Senior level

Cloud • Software

The Senior Site Reliability Engineer will ensure reliability in global monitoring infrastructure, focusing on availability, performance, and growth. Responsibilities include collaborating with software teams, designing deployment models, automating processes, debugging issues, and participating in on-call rotations to enhance incident response capabilities.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+52 More

Atlassian

Principal Site Reliability Engineer

4 Days Ago

San Francisco Bay Area, CA

Remote

9,500 Employees

167K-269K Annually

Expert/Leader

Apply

9,500 Employees

167K-269K Annually

Expert/Leader

Cloud • Information Technology • Productivity • Security • Software

The Principal Site Reliability Engineer at Atlassian will enhance cloud service reliability, optimize operational efficiencies, and lead cross-functional initiatives while mentoring engineers. They will leverage deep expertise in cloud infrastructure and high-availability software management to advocate for reliability practices across teams.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+58 More

CrowdStrike

Sr. Software Engineer - Cloud Platform Reliability (Remote)

2 Days Ago

San Francisco Bay Area, CA

Remote

7,500 Employees

135K-215K Annually

Mid level

Apply

7,500 Employees

135K-215K Annually

Mid level

Cloud • Information Technology • Sales • Security • Cybersecurity

As a Senior Software Engineer at CrowdStrike, you'll develop and maintain reliable and scalable services, enhance monitoring systems, and improve architecture for a cloud-native security platform. You'll collaborate across teams and mentor other developers while promoting best practices, particularly with Go.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+44 More

Atlassian

Site Reliability Engineer

3 Days Ago

San Francisco Bay Area, CA

Remote

9,500 Employees

Junior

Apply

9,500 Employees

Junior

Cloud • Information Technology • Productivity • Security • Software

As a Site Reliability Engineer at Atlassian, you'll focus on enhancing cloud service scalability, reliability, and performance. You'll collaborate with a team to manage caching infrastructure, and automation, while improving code and debugging applications in a high-availability environment.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+58 More

Blackline

Senior Site Reliability Engineer

4 Days Ago

San Francisco Bay Area, CA

Hybrid

1,900 Employees

145K-193K Annually

Senior level

Apply

1,900 Employees

145K-193K Annually

Senior level

Cloud • Fintech • Information Technology • Machine Learning • Software

The Senior Site Reliability Engineer is responsible for ensuring the optimal performance and availability of BlackLine's services and infrastructure. This role involves capacity planning, responding to customer escalations, identifying performance issues, maintaining metric frameworks, and collaborating with development teams to improve application performance and security.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+53 More

Cisco Meraki

Senior Site Reliability Engineer, Cloud Foundations - REMOTE

4 Days Ago

San Francisco Bay Area, CA

Remote

3,000 Employees

126K-185K Annually

Senior level

Easy Apply

3,000 Employees

126K-185K Annually

Senior level

Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI

As a Senior Site Reliability Engineer at Cisco Meraki, you will facilitate cloud adoption, build and maintain infrastructure as code modules, and collaborate on compliance and security practices, ensuring robust and compliant cloud services for the company.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+88 More

Featured Jobs

Blackline

Staff I Reliability Engineer - FedRAMP

Featured

Hybrid

San Francisco Bay Area, CA

Atlassian

Principal Site Reliability Engineer

Featured

Remote

San Francisco Bay Area, CA

Cisco Meraki

Senior Site Reliability Engineer, Cloud Foundations - REMOTE

Featured

Remote

San Francisco Bay Area, CA

Cisco Meraki

Senior Site Reliability Engineer, Engineering Enablement - REMOTE

4 Days Ago

San Francisco Bay Area, CA

Remote

3,000 Employees

126K-185K Annually

Senior level

Easy Apply

3,000 Employees

126K-185K Annually

Senior level

Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI

As a Senior Site Reliability Engineer, you will improve the developer experience for Cloud Engineering teams by designing and evolving infrastructure for cloud applications, resolving complex problems, influencing operational excellence, and collaborating across teams. You will lead incident responses and contribute to sustainable practices.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+88 More

Crunchyroll

Staff Site Reliability Engineer - Data Engineering, Platform

6 Days Ago

San Francisco Bay Area, CA

1,200 Employees

191K-239K Annually

Senior level

Apply

1,200 Employees

191K-239K Annually

Senior level

Digital Media • eCommerce • Gaming • Mobile

As a Staff Site Reliability Engineer, you will enhance the reliability of Crunchyroll's data infrastructure, focusing on automation, monitoring, alerting, and working closely with data engineers to ensure efficient data services. Your efforts will directly impact the availability and performance of services for millions of fans worldwide.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+58 More

Verkada Inc

Site Reliability Engineer - Infrastructure

6 Days Ago

San Francisco Bay Area, CA

1,700 Employees

130K-280K Annually

Junior

Apply

1,700 Employees

130K-280K Annually

Junior

Cloud • Hardware • Security • Software

The Site Reliability Engineer will maintain and improve infrastructure automation, manage scaling, and monitor infrastructure to ensure reliability. Responsibilities also include defining the infrastructure roadmap and providing technical support for engineering teams.

Top Benefits:

401-K

Commuter Benefits

Company Equity

+45 More

Cisco ThousandEyes

Principal Site Reliability Engineer, Datastores

6 Days Ago

San Francisco Bay Area, CA

Hybrid

1,050 Employees

Senior level

Easy Apply

1,050 Employees

Senior level

Cloud • Software

The Principal Site Reliability Engineer will focus on the operational excellence of datastores, ensuring reliability, availability, and performance. Responsibilities include building and supporting mission-critical datastores and collaborating with engineering and product management teams to innovate solutions. The role demands a strong technical vision and expertise in automating scalable systems.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+52 More

Roblox

Senior Software Engineer-Network Reliability Systems

7 Days Ago

San Francisco Bay Area, CA

2,500 Employees

189K-234K Annually

Senior level

Apply

2,500 Employees

189K-234K Annually

Senior level

Computer Vision • Gaming • Software • Virtual Reality • Web3

As a Senior Software Engineer at Roblox, you'll focus on improving network reliability and efficiency by developing automation systems, managing network operations, and collaborating cross-functionally with infrastructure teams. You'll address technical challenges at scale and participate in an on-call rotation.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+46 More

Roblox

Senior SRE, Compute Orchestration

7 Days Ago

San Francisco Bay Area, CA

Hybrid

2,500 Employees

234K-284K Annually

Senior level

Apply

2,500 Employees

234K-284K Annually

Senior level

Computer Vision • Gaming • Software • Virtual Reality • Web3

As a Senior Site Reliability Engineer, you will build and support the infrastructure for Roblox's private cloud, focusing on orchestration systems, service discovery, and performance monitoring. You will automate processes, create fault-tolerant systems, and analyze system designs to ensure reliability and production readiness.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+46 More

Roblox

Engineering Manager/Senior Engineering Manager - Compute SRE

7 Days Ago

San Francisco Bay Area, CA

2,500 Employees

316K-384K Annually

Senior level

Apply

2,500 Employees

316K-384K Annually

Senior level

Computer Vision • Gaming • Software • Virtual Reality • Web3

The Engineering Manager/Senior Engineering Manager leads the Compute infrastructure team, enhancing system reliability and managing production health. Responsibilities include collaborating across functions, building robust infrastructure, and driving projects that improve scalability and performance. A successful candidate will have extensive experience in engineering management and a solid software engineering background.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+46 More

Roblox

Senior Software Engineer - Reliability Engine

7 Days Ago

San Francisco Bay Area, CA

2,500 Employees

219K-284K Annually

Senior level

Apply

2,500 Employees

219K-284K Annually

Senior level

Computer Vision • Gaming • Software • Virtual Reality • Web3

As a Senior Software Engineer on the Reliability team at Roblox, you will enhance engine stability and reliability by developing software solutions to monitor performance and crash metrics, mitigating incidents, and automating processes for improved reliability. You will work on various applications across platforms, collaborate with a team, and engage in a call rotation for incident management.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+46 More

Crusoe Energy Systems

Senior/Staff Site Reliability Engineer

7 Days Ago

San Francisco Bay Area, CA

450 Employees

180K-225K Annually

Senior level

Apply

450 Employees

180K-225K Annually

Senior level

Cloud • Greentech • Other • Energy

As a Senior Site Reliability Engineer at Crusoe, you will ensure the reliability and performance of infrastructure by detecting, analyzing, and preventing issues, while automating processes and collaborating with engineering teams. Your role includes monitoring system metrics, incident response, and driving continuous improvement based on customer needs.

Top Benefits:

401-K

401-K Matching

Commuter Benefits

+34 More

Webflow

Senior Site Reliability Engineer

8 Days Ago

San Francisco Bay Area, CA

Remote

200 Employees

Senior level

Easy Apply

200 Employees

Senior level

eCommerce • Software • Design • SEO

As a Senior Site Reliability Engineer at Webflow, you will enhance the reliability of customer-facing infrastructure, improve observability practices, optimize applications in Kubernetes, and work closely with various teams to ensure platform stability and security for millions of users.

Top Benefits:

401-K

Commuter Benefits

Company Equity

+36 More

Roblox

Network Reliability Systems - Principal Software Engineer

9 Days Ago

San Francisco Bay Area, CA

Hybrid

2,500 Employees

284K-332K Annually

Senior level

Apply

2,500 Employees

284K-332K Annually

Senior level

Computer Vision • Gaming • Software • Virtual Reality • Web3

As a Principal Software Engineer at Roblox, you will design automation and reliability systems for the global network infrastructure, lead projects, collaborate cross-functionally, and contribute to a strong engineering culture. Responsibilities include building cutting edge systems and participating in on-call rotations.

Top Benefits:

401-K

401-K Matching

Child Care Benefits

+46 More

Cisco Meraki

Lead Site Reliability Engineer - Remote

10 Days Ago

San Francisco Bay Area, CA

Remote

3,000 Employees

173K-242K Annually

Senior level

Easy Apply

3,000 Employees

173K-242K Annually

Senior level

Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI

The Lead Site Reliability Engineer will design, implement, and lead a highly available orchestration platform, influence architectural decisions focusing on security and performance, build documentation for automation and resiliency, and support cloud technologies. The role involves mentoring, leading projects, and participating in a 24x7 on-call rotation.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+88 More

Atlassian

Site Reliability Engineer

10 Days Ago

San Francisco Bay Area, CA

Remote

9,500 Employees

Junior

Apply

9,500 Employees

Junior

Cloud • Information Technology • Productivity • Security • Software

The Site Reliability Engineer will work with the SRE team to manage and improve caching infrastructure and automation for Atlassian's Cloud products. Responsibilities include ensuring high availability and scalability of services, debugging and improving code, and automating routine tasks while collaborating with various teams.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+58 More

Cisco Meraki

Site Reliability Engineer, FedRamp, Remote in the U.S.

12 Days Ago

San Francisco Bay Area, CA

Remote

3,000 Employees

95K-153K Annually

Mid level

Easy Apply

3,000 Employees

95K-153K Annually

Mid level

Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI

The Site Reliability Engineer will maintain the reliability, performance, and scalability of production systems, collaborating with various teams to ensure availability and compliance. Key responsibilities include implementing robust monitoring systems, participating in audits, ensuring industry best practices, and promoting automation processes.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+88 More

Cisco Meraki

Lead Site Reliability Engineer , Cloud Platform - Remote

12 Days Ago

San Francisco Bay Area, CA

Remote

3,000 Employees

173K-242K Annually

Senior level

Easy Apply

3,000 Employees

173K-242K Annually

Senior level

Hardware • Information Technology • Security • Software • Cybersecurity • Generative AI

As a Lead Site Reliability Engineer, you will design, develop, and operate a secure cloud platform. Responsibilities include enabling cloud adoption for teams, managing costs, building automated reporting capabilities, maintaining infrastructure as code, and collaborating on cloud strategy and compliance.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+88 More

Figure.ai

Senior Reliability Engineer

7 Days Ago

San Francisco Bay Area, CA

86 Employees

Senior level

Apply

86 Employees

Senior level

Artificial Intelligence • Robotics • Automation • Manufacturing

The Senior Reliability Engineer will design and execute test plans for humanoid robots, ensuring reliability and durability by analyzing failures and providing data-driven recommendations to design teams. Responsibilities include conducting accelerated life tests, collaborating with hardware engineers, documenting failures, and supporting failure analysis efforts.

Upstart

Senior Site Reliability Engineer

20 Days Ago

San Francisco Bay Area, CA

Remote

1,500 Employees

160K-222K Annually

Senior level

Easy Apply

1,500 Employees

160K-222K Annually

Senior level

Artificial Intelligence • Fintech • Machine Learning • Social Impact • Software

As a Senior Site Reliability Engineer at Upstart, you will enhance the reliability and performance of our production systems. You'll implement monitoring standards, improve incident response practices, and automate operations to support a high-quality customer experience. This role requires collaboration with teams to enhance system effectiveness.

Top Benefits:

401-K

401-K Matching

Adoption Assistance

+56 More