Kentik is the network observability company. Our platform is a must-have for the network front line, whether digital business, corporate IT, or service provider. Network professionals turn to the Kentik Network Observability Cloud to plan, run, and fix any network, relying on our infinite granularity, AI-driven insights, and insanely fast search.
Kentik makes sense of network, cloud, host, and container flow, Internet routing, performance tests, and network metrics. We show network pros what they need to know about their network performance, health, and security to make their business-critical services shine. Networks power the world’s most valuable companies, and those companies trust Kentik. Market leaders like IBM, Box, and Zoom rely on Kentik for network observability. Visit us at kentik.com and follow us at @kentikinc.
What we do
Our platform ingests trillions of records and serves hundreds of thousands of queries for our users each day. You will gain experience building a production quality, high performance server-and-client SaaS application that handles uniquely high volumes of data.
We have built a team of world-class engineers, network experts, and technology thought leaders in a remote-friendly culture from day one. While prior experience in a remote environment is not required, we highly value strong collaboration and communication skills, as well as a high level of independence and autonomy.
*This is a remote role. However, due to the location of the teams we are hiring for, working hours in US time zones is a requirement for this position.
What you'll do
Kentik is looking for a Senior level Site Reliability Engineer (Cloud) to join our Product Engineering team. This person will help build and maintain our Synthetics and Cloud product lines. These products have multiple applications deployed in various cloud providers all over the world, and we manage these cloud applications using observability tooling, automated build processes, and adherence to configuration as code best practices.
We’re looking for an experienced engineer who will work with engineering teams across the company to help grow our hardware and software infrastructure. We operate a well-organized, well-instrumented platform, and offer enormous opportunities for employee growth.
- Ensure our real-time, scalable, infrastructure is set up for growth and working efficiently. Our infrastructure runs on our own hardware, across multiple locations as well as all major cloud vendors
- Work on tools and processes to better monitor our platform as well as ensuring its stability through our rapid growth
- Deep-dive into diverse topics, from firewalls and IP routing, to database replication strategies or automating build processes
- Collaborate with engineering and infrastructure teams on finding solutions from an operational perspective
- Assist with expanding our cloud deployments across the major cloud providers
- Contribute code, code reviews and tools or patches to all kinds of existing code
- Write design documents or collaborate on colleagues’ docs to introduce new features or changes into our infrastructure
- Provide valuable feedback on team goals, projects, and processes. We believe in continuously improving our team
- 5+ years of experience in cloud-based Systems Administration, IT and/or SRE related projects
- Strong experience with public cloud, container and orchestration technologies including AWS, GCP, Azure, Kubernetes, and Docker
- Solid programming and automation skills (Bash, Python, Go) including experience working with configuration management (infrastructure as code) platforms such as Terraform, Ansible, and Puppet
- Experience working with *nix system command line (e.g. ssh, grep, awk)
- Detailed understanding of major internet protocols (TCP/IP, DNS, HTTP, TLS)
- Networking administration experience: concepts such as routing, firewalls (iptables), peering sound familiar
- A passion for documenting code, processes, and infrastructure in runbooks and wikis
- Worked with metrics monitoring solutions such as grafana, prometheus, telegraf, and OpenTelemetry
- Experience creating and managing tickets with third party vendors and owning cloud vendor partner relationships
Nice to haves:
- Familiarity with Kubernetes orchestration and automation tools such as Helm, Kustomize, ArgoCD, and Flux
- Experience optimizing CI/CD pipelines such as GitHub Actions, Earthly and Jenkins
- Exposure to PagerDuty Integrations
- Knowledge of SRE, DevOps and GitOps practices and principles
What we offer
Kentik is a fully remote company that operates globally. We seek professionals that will help us thrive as an organization, and in turn, to broaden and enhance your career. We’re very thorough in the interview process to understand your skills and how they will relate to your successful growth here at Kentik. Our compensation philosophy encompasses a fair program for all in order to attract, engage and retain talented individuals who will drive our business and wow our customers.
The compensation range for this position is: $159,000 - $215,000. This range reflects the low and high end of the U.S. compensation range Kentik reasonably and generally expects to pay the hired candidate in this role. The actual compensation offered may be lower or higher than the stated range depending on various factors, including but not limited to:
- Experience with the skill sets required for success
- Demonstrated competencies and potential
- A geographic market-based approach
In addition to a great career opportunity, Kentik offers stellar benefits for our employees, which include:
- 100% of premiums are paid by company for health, vision and dental coverage for you and your dependents
- Additionally, an annual Health Reimbursement Account (HRA) of $3,000 for an individual or $4,500 for a family
- Paid family & medical leave
- Open PTO, a quarterly Wellness Day, and a minimum of 10 paid holidays
- 401(k) retirement account
- Home office reimbursement
- Stock options
Note: Benefits are as listed for all US full-time employees. For compensation, international applicants will be treated equitably in relation to the laws applicable within the countries in which we operate.
The true meaning of Kentik is visibility. We’re committed to making sure everyone feels empowered to use their voice, has a sense of belonging, and is represented at Kentik.
We don’t look for individuals who fit the culture, but those who will continue to add to the culture.
We encourage everyone to apply, especially those individuals who are underrepresented in the industry: people of color, LGBTQI+ community, women, individuals with disabilities (both seen and unseen), veterans, and people of any age or family status.
Kentik is committed to creating an inclusive interview process. If you require a reasonable accommodation during the application or interview process, please reach out to recruiting@kentik.com.
Come as you are!
You will be working at a fast-growing, well-funded startup alongside industry thought leaders and network aficionados as we build the future of observability and set the high bar for how network operations and digital businesses should run. With a competitive salary and amazing benefits on top of the meaningful and challenging projects you’ll take on, we’re sure you’ll enjoy joining the Kentik team.
#li-remote
Top Skills
Kentik San Francisco, California, USA Office
548 Market St, Pmb 78595, San Francisco, CA, United States, 94104
Similar Jobs
What you need to know about the San Francisco Tech Scene
Key Facts About San Francisco Tech
- Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Google, Apple, Salesforce, Meta
- Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
- Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
- Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine