Invisible AI Logo

Invisible AI

Site Reliability Engineer (SRE)

Job Posted 18 Days Ago Posted 18 Days Ago
Remote
110K-170K Annually
Senior level
Remote
110K-170K Annually
Senior level
As a Site Reliability Engineer, you will build and maintain scalable infrastructure, develop automation solutions, deploy applications, manage observability, and optimize Linux systems while collaborating with engineering teams.
The summary above was generated by AI

At Invisible AI, we are building the future of computer vision. Today, our core focus is on developing an end-to-end platform that can digitize manufacturing operations. We deploy edge AI cameras to digitize all steps of manual assembly work which helps people-driven manufacturing be accurate, reliable, and safe. Coming from the world of self-driving cars, the founders of Invisible AI have years of experience in building and deploying large-scale AI & Machine Learning pipelines. Join us and help build a company that will deliver the endless possibilities of computer vision to real-world customers!


As a Site Reliability Engineer, you will build the technology to enable our platform to deploy, run, and monitor Invisible AI’s software at scale across tens of independent deployments and thousands of devices. The SRE works closely with all other engineering teams and owns internal tools to enable faster development and deployment, like secure ephemeral debug environments, streamlined access controls, CI/CD systems, and a custom in-house device management platform for device configuration and software releases.

Responsibilities:

  • Design, build, and maintain scalable and resilient infrastructure on the edge.
  • Develop automation and infrastructure-as-code solutions using Terraform, Ansible, and scripting languages (Python, Bash).
  • Deploy and manage containerized applications using Docker and related technologies.
  • Ensure system observability by building and optimizing monitoring systems, particularly using Prometheus.
  • Troubleshoot and optimize Linux-based systems (e.g., Red Hat, CentOS, Ubuntu).
  • Collaborate with security teams to implement robust security practices and ensure compliance with best practices.
  • Work closely with software engineers to improve system performance, reliability, and deployment pipelines.
  • Support and maintain networking infrastructure, including troubleshooting protocols and configurations.
  • Manage cloud and on-premise infrastructure, with a focus on automation and scalability.
  • Contribute to incident response, postmortems, and process improvements.

Requirements:

  • 5+ years of experience  in Site Reliability Engineering and building/managing infrastructure at scale, particularly on the edge.
  • Strong software development experience in one or more programming languages (e.g., Python, Go, Java).
  • Proficiency in Python, Docker, Linux systems, and scripting (Bash, Python).
  • Strong expertise with infrastructure automation tools (Terraform, Ansible).
  • Experience managing observability and monitoring systems, particularly Prometheus.
  • Deep understanding of networking concepts and protocols.
  • Familiarity with cloud platforms (AWS, Azure, Google Cloud) is a plus.
  • Experience with Windows Services/VMs is a plus.
  • Excellent problem-solving skills, with attention to detail.
  • Strong communication and collaboration skills to work across teams.
  • Bachelor’s degree in Computer Science, Information Technology, or a related field, or equivalent experience.

Our compensation package plays a big part in how we value your impact on our mission. Our base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The estimated base salary guideline range for this role is between $110,000-$170,000 and may be modified. This will vary based on various factors, including market and individual qualifications objectively assessed during the interview process. In addition to base salary, your compensation package will include additional components such as equity, sales incentive pay (for sales roles), and benefits. Invisible AI is an equal-opportunity employer. We do not discriminate based on age, ethnicity, gender, nationality, religious belief, or sexual orientation.

Top Skills

Ansible
AWS
Azure
Docker
GCP
Linux
Prometheus
Python
Terraform

Similar Jobs

2 Days Ago
Remote
United States
118K-231K Annually
Senior level
118K-231K Annually
Senior level
Big Data • Cloud • Software • Database
The Senior Site Reliability Engineer will design, implement, and enhance systems for infrastructure development, focusing on automation, reliability, and developer experience.
Top Skills: AWSAzureBazelCrossplaneGCPGithub ActionsKubernetesTerraform
Yesterday
Remote
United States
161K-180K Annually
Senior level
161K-180K Annually
Senior level
Consumer Web • Digital Media • Information Technology • News + Entertainment • Social Media
The Senior Site Reliability Engineer will enhance infrastructure resilience, optimize system performance, and improve both physical and cloud systems while collaborating with engineering teams.
Top Skills: AnsibleCC++DockerGoJavaKubernetesPythonTerraformUnix/Linux
2 Days Ago
Easy Apply
Remote
3 Locations
Easy Apply
104K-222K Annually
Mid level
104K-222K Annually
Mid level
Cloud • Security • Software • Cybersecurity • Automation
The Intermediate Site Reliability Engineer at GitLab will design scalable networking infrastructure, collaborate on projects, respond to incidents, and automate operational tasks.
Top Skills: AnsibleBashChefGitlab CiGoGoogle Cloud PlatformKubernetesRubyTerraform

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account