Software Development Engineer - SRE

Sorry, this job was removed at 02:52 p.m. (PST) on Wednesday, Aug 21, 2024
Be an Early Applicant
Cupertino, CA
5-7 Years Experience
Hardware • Retail • Software • Wearables
The Role

Summary

People at Apple don't just build products - they craft experiences our customers love and depend on. Apple Services Engineering (ASE) builds and supports the systems that make many of these daily experiences possible. If you've used Apple products, you've likely interacted with us. iCloud Services SRE teams are responsible for the systems and services that directly support those customers and their experiences. We focus on availability and automation of key services that run iCloud every minute of every day all around the world.

Key Qualifications

Experience with large scale distributed systems, especially ML infrastructure and services including LLMs, Generative AI, and transformers

Knowledge of core operating system principles, networking fundamentals, and systems management

Demonstrable fluency in at least one of Java, Python, Swift, Rust or GoLang

Awareness of key security principles including encryption, keys (types and exchange protocols)

Understanding of SRE principals including monitoring, alerting, error budgets, fault analysis, and automation

Strong sense of ownership, with a desire to communicate and collaborate with other engineers and teams

Ability to succinctly identify and communication technical and architectural problems, while working with partners and their team to iteratively find solutions

Description

We are looking for an SRE with experience building and supporting machine learning (ML) infrastructure. You will apply SRE best practices to ensure the availability, reliability, and performance of our ML systems and services. You will actively engage with our development partners and product teams regularly so the ML services we well aligned with business needs.

If you love designing and running systems and infrastructure that will delight millions of customers this team is for you!

Responsibilities will include:

Support and maintain ML services by measuring and monitoring availability, latency, and overall system health

Deploy and support existing and new ML models and infrastructure

Provide insights to partner stakeholders through log and telemetry analysis

Maintaining documentation and automating manual processes where possible

Be part of an oncall rotation providing hands-on technical expertise during service impacting events

Collaborate with other engineers on code, infrastructure, and design reviews, and process enhancements

Education & Experience

BS degree in computer science or equivalent field with 5+ years of experience

The Company
Cupertino
165,000 Employees
On-site Workplace
Year Founded: 1976

What We Do

We’re a diverse collective of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways. The people who work here have reinvented entire industries with the Mac, iPhone, iPad, and Apple Watch, as well as with services, including Apple TV, the App Store, Apple Music, and Apple Pay. And the same innovation

Gallery

Gallery

Similar Companies Hiring

Monte Carlo Thumbnail
Software • Cloud • Big Data Analytics • Big Data
San Francisco, CA
165 Employees
Headway Thumbnail
Software • Social Impact • Professional Services • Healthtech • Consumer Web
San Francisco, CA
504 Employees
Resident Thumbnail
Retail • Manufacturing • eCommerce
San Francisco, CA
322 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account