Manager, Site Reliability Engineering

Posted 5 Days Ago
Easy Apply
Be an Early Applicant
Palo Alto, CA
Hybrid
146K-255K Annually
Senior level
Fintech • Information Technology • Payments • Productivity • Software • Travel • Automation
Travel & expense made easy.
The Role
As a Site Reliability Engineering (SRE) Manager at Navan, you will lead SRE teams to design, implement, and automate scalable infrastructure. You'll collaborate with engineering and product teams, improve incident response processes, define service-level objectives, and ensure 24x7 system availability using AWS and observability tools.
Summary Generated by Built In

At Navan, “It’s all about the user. All of them.” We’re passionate about providing a seamless one-stop experience for business travelers, no matter how they travel, where they stay, or where they’re going. We are committed to building the most reliable, scalable, and efficient infrastructure to ensure our services are always available when travelers need them most. With our rapid growth, we face exciting challenges ahead and are seeking a Site Reliability Engineering (SRE) Manager to join our team in headquarters based out of Palo Alto, California.

As a SRE Manager, you will lead a team of senior and experienced SREs, driving innovation in infrastructure design, automation, and tooling. You will spearhead the development of infrastructure services that power Navan’s systems, serving thousands of travelers daily. Your role will include partnering with development, release and productivity, and security teams to identify user needs and deliver cutting-edge solutions.

You will oversee a diverse range of systems and technologies with the goal of building autonomous, fault-tolerant, and monitored infrastructure. This infrastructure will be optimized for simplicity, performance, and uptime. Collaborating with backend and frontend engineering teams, you will ensure that our systems are scalable, reliable, and efficient. Additionally, you will lead efforts to design and implement infrastructure capable of supporting our exponential growth while maintaining the highest levels of service reliability and operational excellence. 

What You'll Do

  • Lead & Mentor the SRE Team: Guide and develop a high-performing team of SREs, fostering a culture of collaboration, reliability, and continuous improvement.
  • Drive Infrastructure Reliability & Automation: Collaborate with Engineering and Product teams to design and implement scalable, fault-tolerant systems. Leverage IaC tools (e.g., Terraform, CloudFormation) and microservices architectures to automate and improve infrastructure.
  • Incident Management: Improve incident response processes, reduce MTTR, and proactively mitigate risks. Apply resiliency patterns to ensure systems are fault-tolerant and highly available.
  • Define & Measure SLOs: Develop service-level objectives (SLOs) and KPIs to track and improve system reliability, using tools like NewRelic or DataDog for observability.
  • 24x7 Production Support: Ensure system availability in a 24x7 environment, applying expertise in AWS (e.g., ECS, Lambda, DynamoDB) and database management for optimal performance.
  • Optimize CI/CD Pipelines: Automate and streamline deployment workflows using tools like Jenkins or GitHub Actions to ensure faster and more reliable deployments.
  • Resource Management: Manage team resources, including capacity planning, hiring, and upskilling, to meet evolving business needs.

What We're Looking For

  • 8+ years in Site Reliability Engineering, DevOps, or Infrastructure roles, with at least 3 years in a leadership position.
  • Proven ability to lead and mentor teams, fostering a culture of collaboration and reliability.
  • Hands-on experience with AWS cloud technologies, Infrastructure as Code (Terraform/CloudFormation), microservices architectures, deployment automation (Jenkins/GitHub Actions), and observability tools (NewRelic/DataDog).
  • Strong background in designing scalable, fault-tolerant systems, improving incident response, and driving operational improvements.
  • Excellent interpersonal and communication skills, with the ability to work effectively across cross-functional teams.

The posted pay range represents the anticipated low and high end of the compensation for this position and is subject to change based on business need. To determine a successful candidate’s starting pay, we carefully consider a variety of factors, including primary work location, an evaluation of the candidate’s skills and experience, market demands, and internal parity.
For roles with on-target-earnings (OTE), the pay range includes both base salary and target incentive compensation. Target incentive compensation for some roles may include a ramping draw period. Compensation is higher for those who exceed targets. Candidates may receive more information from the recruiter.

Pay Range

$146,250$255,000 USD

Top Skills

AWS
CloudFormation
Github Actions
Jenkins
Terraform

What the Team is Saying

Anna
Brian
Roshni
Adamas Victória
Jordan
The Company
Palo Alto, CA
3,000 Employees
Hybrid Workplace
Year Founded: 2015

What We Do

Navan is the all-in-one super app that makes travel and expense easy so you can focus on being there, not getting there. Say goodbye to spending hours on the phone trying to change your flight or saving stacks of receipts to manually input expenses. From EAs and finance teams to travel managers and employees, Navan empowers people to focus on the things that matter most to them — all while providing companies with real-time visibility, savings, and control.

Navan’s investors include visionaries like Andreessen Horowitz, Lightspeed Ventures, Greenoaks, Zeev Ventures, and entrepreneurs Lee Fixel, Adam Bain, and Elad Gil. In Oct 2022, Navan announced its Series G upround at a post-money valuation of $9.2B to help accelerate future growth plans.

In April 2023, Navan expanded in the Indian market with the acquisition of Tripeur, a modern, people-centric corporate travel and expense management company. The group’s fifth acquisition in under two years, Tripeur joined the Navan Group alongside Spanish meetings and events specialists, Atlanta Events & Corporate Travel Consultants; Berlin-based modern travel management company, Comtravo; leading Scandinavian travel agency Resia AB; and London-based high-touch TMC, Reed & Mackay.

Why Work With Us

At Navan, we’re never satisfied with the status quo, and we know breakthrough ideas come from diverse perspectives. We are committed to cultivating a workplace that reflects the diversity of the customers we serve while fostering leadership and innovation.

Gallery

Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery
Gallery

Navan Offices

Hybrid Workspace

Employees engage in a combination of remote and on-site work.

In-person connections is the foundation of Navan, the connections forged through face-to-face interactions improve company culture and what we can achieve together. We operate on a hybrid working model, which we define as three days a week in-office.

Typical time on-site: 3 days a week
Palo Alto, CA

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account