Staff Site Reliability Engineer (SRE)

Sorry, this job was removed at 03:31 p.m. (PST) on Friday, Aug 02, 2024
Be an Early Applicant
San Francisco, CA
Hybrid
170K-200K Annually
7+ Years Experience
Wearables
The Role

About Augmedix:


Augmedix (Nasdaq: AUGX) delivers industry-leading, ambient medical documentation and data solutions to healthcare systems, physician practices, hospitals, and telemedicine practitioners.

 

Augmedix is on a mission to help clinicians and patients form a human connection by seamlessly integrating our technology at the point of care. Augmedix’s proprietary platform digitizes natural clinician-patient conversations, which are converted into comprehensive medical notes and structured data in real time. The company’s platform uses automatic speech recognition, and natural language processing, including large language models, to generate accurate and timely medical notes that are transferred into the EHR. 

 

Augmedix’s products relieve clinicians of administrative burden, in turn, reducing burnout, increasing clinician efficiency and improving patient access. Through Augmedix’s proprietary platform and bi-directional communication channel, Augmedix is ideally suited to serve as the vehicle for change at the point of care.

 

Augmedix is headquartered in San Francisco, CA, with offices around the world. To learn more, visit www.augmedix.com.


About the Role:


We are seeking a highly skilled and experienced Staff SRE to join our growing team. You will play a critical role in ensuring the reliability, scalability, and performance of our critical infrastructure and applications. Beyond core SRE responsibilities, you will also serve as a key liaison across various teams, fostering collaboration and ensuring seamless operations.

Responsibilities:

  • Proactively identify and mitigate potential issues impacting infrastructure and applications
  • Partner with development teams to implement best practices for building reliable and scalable systems
  • Stay up-to-date on the latest SRE trends and technologies

  • Monitoring and Observability:

  • Design, implement, and maintain robust monitoring solutions using tools like Prometheus and Grafana
  • Develop and configure alerts within tools like PagerDuty to ensure timely notification of potential issues
  • Analyze and troubleshoot issues using collected application and infrastructure metrics

  • Incident Management:

  • Lead incident response, ensuring timely resolution and minimizing downtime
  • Document and communicate incident details effectively to stakeholders
  • Conduct post-incident reviews to identify root causes and implement preventative measures

  • Service Level Agreements (SLAs):

  • Collaborate with product and engineering teams to define clear and measurable SLAs for our SaaS offerings
  • Establish Service Level Objectives (SLOs) for key metrics based on SLA requirements
  • Define Service Level Indicators (SLIs) to track progress towards achieving SLOs
  • Monitor SLO compliance and proactively identify potential SLA breaches

  • Automation:

  • Identify opportunities for automation to improve efficiency and reliability
  • Develop and implement automation scripts using tools like Python or Bash
  • Automate routine tasks and incident response workflows

  • Cross-Team Collaboration:

  • Act as a liaison between SRE, Product, Security, Application Engineering, and Customer Operations teams
  • Facilitate communication and information sharing across teams to ensure smooth operations
  • Work collaboratively to define and implement solutions that meet the needs of all stakeholders

  • Mentorship and Knowledge Sharing:

  • Mentor and collaborate with junior SRE engineers
  • Share knowledge and best practices within the team
  • Contribute to the development and documentation of internal SRE processes

Requirements:

  • 5-8 years of experience as a Site Reliability Engineer (SRE) or related role
  • Proven experience with monitoring tools like Prometheus and Grafana
  • Strong understanding of incident management best practices
  • Experience with alerting tools like PagerDuty
  • Experience with scripting languages like Python or Bash for automation
  • Excellent communication and collaboration skills
  • Ability to work independently and as part of a team
  • Strong problem-solving and analytical skills
  • Passion for building reliable and scalable systems

Bonus:

  • Experience with cloud platforms like AWS, GCP, or Azure
  • Experience with container orchestration platforms like Kubernetes
  • Experience with chaos engineering principles
  • Experience with configuration management tools like Ansible or Chef

Augmedix is an equal opportunity employer. We are committed to providing equal employment opportunities regardless of sex, gender identity, race, religious creed, color, ancestry, age, disability, marital status, sexual orientation including being transgender and/or any other protected bases.

The Company
San Francisco, CA
417 Employees
On-site Workplace
Year Founded: 2012

What We Do

Augmedix is a telemedicine charting service for healthcare providers that eliminates the 17 hours each week providers spend on EHR documentation. Using wearable technology to connect their clinic with the Augmedix charting service, providers can focus on what they do best: taking care of patients.

Jobs at Similar Companies

News 12 Logo News 12

Longform Writer / Producer

Consumer Web • Digital Media • News + Entertainment
Hybrid
Bronx, NY, USA
720 Employees
64K-106K Annually

Capital One Logo Capital One

Associate, Process Manager - New Grad 2025

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Toronto, ON, CAN
55000 Employees

Capital One Logo Capital One

Intern, Process Management - Summer 2025

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Toronto, ON, CAN
55000 Employees

Capital One Logo Capital One

Manager, Process Management - Customer Resiliency

Fintech • Machine Learning • Payments • Software • Financial Services
Hybrid
Toronto, ON, CAN
55000 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account