WitnessAI Logo

WitnessAI

Site Reliability Engineer - Platform Engineering

Job Posted 6 Days Ago Posted 6 Days Ago
Be an Early Applicant
7 Locations
170K-200K Annually
Mid level
7 Locations
170K-200K Annually
Mid level
As a Site Reliability Engineer, you will be responsible for maintaining system reliability, managing infrastructure on AWS, and optimizing automation processes. You will collaborate with engineering teams to enhance system performance, troubleshoot incidents, and implement best practices in security and compliance.
The summary above was generated by AI

Job Title: Site Reliability Engineer (SRE), Platform Engineering

About Us: WitnessAI is a leader in providing innovative networking solutions designed to enhance security, performance, and reliability for businesses of all sizes.  We are seeking a highly skilled Site Reliability Engineer (SRE) with a strong background in Linux administration, AWS, and Kubernetes for our Platform Engineering team. The ideal candidate will help ensure the reliability, scalability, and performance of our systems while driving a culture of automation and continuous improvement.

Key Responsibilities

System Reliability & Operations

  • Maintain and improve the reliability, availability, and performance of our services and infrastructure.

  • Monitor system health, troubleshoot issues, and respond to incidents with a focus on reducing mean time to recovery (MTTR).

Infrastructure Management

  • Administer and optimize Linux-based systems across development, staging, and production environments.

  • Design and manage scalable, secure, and cost-effective solutions on AWS.

  • Build, maintain, and monitor Kubernetes clusters to support containerized applications.

Automation & Tooling

  • Develop and maintain CI/CD pipelines to streamline deployments.

  • Automate operational tasks using tools such as Terraform, Crossplane, or custom scripts.

  • Create and enhance monitoring, alerting, and logging systems to improve observability.

  • Build ad-hoc, reusable automation solutions where required.

Collaboration & Best Practices 

  • Partner with engineering teams to integrate SRE principles into the software development lifecycle.

  • Advocate for best practices in incident response, post-mortem reviews, and capacity planning.

  • Share knowledge with team members and contribute to a culture of continuous improvement.

Security & Compliance

  • Implement security best practices for cloud and containerized environments.

  • Ensure compliance with organizational and industry standards.

Requirements

Technical Skills

  • Proven expertise in Linux system administration (e.g., Ubuntu, CentOS, or similar).

  • Deep understanding of AWS services and architecture (e.g., EC2, S3, RDS, VPC, IAM).

  • Strong experience managing Kubernetes clusters in production.

  • Hands-on experience with infrastructure-as-code tools like Terraform or CloudFormation

  • Proficiency in scripting or programming languages (e.g., Python, Bash, or Go).

  • Demonstrated experience in app development for ba lend automation solutions.

  • 3+ years of experience in a Site Reliability Engineer, DevOps Engineer, or similar role working for a SaaS or Cloud bases company.

Operational Expertise

  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK, or Datadog

  • Experience designing and maintaining CI/CD pipelines (e.g., Jenkins, GitLab CI, or CircleCI).

  • Understanding of networking concepts (e.g., DNS, load balancing, firewalls).

Problem Solving & Collaboration

  • Strong analytical and troubleshooting skills.

  • Ability to work effectively in a collaborative, team-oriented environment.

  • Excellent written and verbal communication skills.

Education

Bachelor’s degree in Computer Science, Engineering, or equivalent work experience.

Nice-to-Have Skills:

  • Experience with service meshes and other CNCF technologies (e.g., Istio or Linkerd).

  • Knowledge of database systems (e.g., MySQL, PostgreSQL, or NoSQL databases).

  • Familiarity with cloud-native technologies and tools (e.g., Helm, ArgoCD, Spinnaker).

Benefits:

  • Hybrid work environment

  • Competitive salary.

  • Health, dental, and vision insurance.

  • 401(k) plan.

  • Opportunities for professional development and growth.

  • Generous vacation policy.

Salary range:

$170,000-$200,000

Top Skills

AWS
Bash
CircleCI
Datadog
Elk
Gitlab Ci
Go
Grafana
Jenkins
Kubernetes
Linux
Prometheus
Python
Terraform
HQ

WitnessAI San Mateo, California, USA Office

San Mateo, CA, United States, 94403

Similar Jobs

3 Days Ago
Toronto, ON, CAN
Senior level
Senior level
Big Data • Cloud • Internet of Things
The Staff Software Engineer will enhance the reliability and performance of systems, automate infrastructure management, and support data engineering efforts.
Top Skills: Apache BeamApache KafkaSparkAWSAzureCloudFormationDatadogDockerGithub ActionsGCPGoogle Cloud DataflowGrafanaJavaJenkinsKubernetesPrometheusPythonTerraform
Senior level
Big Data • Cloud • Internet of Things
Seeking a Senior Platform Engineer & SRE to enhance system reliability, performance, and scalability through automation, monitoring, and collaboration with teams to implement DevOps best practices.
Top Skills: Apache BeamApache KafkaSparkAWSAzureCi/CdCloudFormationDatadogDockerGithub ActionsGCPGoogle Cloud DataflowGrafanaJavaJenkinsKubernetesPrometheusPythonTerraform
2 Hours Ago
Easy Apply
Remote
Hybrid
Canada
Easy Apply
116K-160K Annually
Senior level
116K-160K Annually
Senior level
Hardware • Information Technology • Security • Software • Cybersecurity • Conversational AI
The Senior Database Engineer will coordinate and maintain database systems, optimize performance, support engineering teams, and develop ETL pipelines.
Top Skills: ActiverecordsEtl PipelinesLiquibaseNoSQLPerlPostgresPythonRedisShell

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account