AIML - Sr Data Engineer - Siri, Data and ML Innovation

Sorry, this job was removed Sorry, this job was removed at 02:53 p.m. (PST) on Wednesday, Aug 21, 2024
Be an Early Applicant
Cupertino, CA
Hardware • Retail • Software • Wearables
The Role

Summary

The AIML Data organization seeks to improve products by using data as the voice of our customers. Within this organization, the Siri Data Engineering team builds systems that process data reliably at scale to generate scalable and high-quality datasets that support confident, data informed decision-making for improving understanding capabilities of Siri Assistant.

We're looking for exceptional data engineers who are passionate about our product and values; who love working with data at scale, and who are committed to continuously improve. As a part of this group, you will work with petabytes of data daily using diverse technologies. You will be expected to effectively partner with upstream engineering teams and downstream consumers, including data scientists and ML engineers.

Key Qualifications

7+ years of technical experience designing, building, and maintaining distributed data processing platforms.

5+ years of industry experience working with batch or streaming distributed data processing technologies (e.g. Spark, Flink, Kafka, Presto, Hadoop, MapReduce, etc.) for building efficient & large-scale data pipelines.

3+ years of data modeling experience designing data warehouse table schemas and logging schemas.

Proficiency in at least one high-level programming language (Python, Java, Scala, Go or equivalent).

Experience with large, complex, highly dimensional data sets; hands-on experience with SQL.

Experience working with cross-functional teams to collect business requirements, build consensus, and manage expectations

You are self-directed and capable of operating amidst ambiguity.

You are humble, continually growing in self-awareness, and possessing a growth mindset.

You are curious and have excellent written and verbal communication as well as problem-solving skills.

You are excited about digging into massive petabyte-scale semi-structured datasets.

Description

In this role, you will be building ultra large scale batch & streaming datasets to support analytics, experimentation and machine learning and helping to drive our self-serve strategy for reporting on-behalf of data scientists and product engineers as we collectively make product better.

You will help design instrumentation required to log data from device and server side and validate data is flowing in the correct shape, frequency, and quality into the Data Warehouse. Curate a high performance and easy to understand data model that meets the needs of the many.

Identify common patterns and build self-serve tools to scale data engineering, and automate lifecycle of datasets with the highest standards of data quality.

Educate your consumers on how to access your products, assuring transparency and understanding in logic definitions and enabling self-service.

Education & Experience

Surprise us! Many will have an MS or BS in CS, Engineering, Math, Statistics, or a related field or equivalent practical experience in data engineering.

The Company
Cupertino
165,000 Employees
On-site Workplace
Year Founded: 1976

What We Do

We’re a diverse collective of thinkers and doers, continually reimagining what’s possible to help us all do what we love in new ways. The people who work here have reinvented entire industries with the Mac, iPhone, iPad, and Apple Watch, as well as with services, including Apple TV, the App Store, Apple Music, and Apple Pay. And the same innovation

Gallery

Gallery

Similar Jobs

Anduril Logo Anduril

Senior Data Engineer

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
4500 Employees
150K-225K Annually

Flawless (flawlessai.com) Logo Flawless (flawlessai.com)

Data Engineer

Artificial Intelligence • Software
Hybrid
90401, Santa Monica, CA, USA
224 Employees
160K-185K Annually
San Francisco, CA, USA
4 Employees
150K-200K Annually

Flawless (flawlessai.com) Logo Flawless (flawlessai.com)

Senior Data Engineer

Artificial Intelligence • Software
Hybrid
90401, Santa Monica, CA, USA
224 Employees
180K-225K Annually

Similar Companies Hiring

Hedra Thumbnail
Software • News + Entertainment • Marketing Tech • Generative AI • Enterprise Web • Digital Media • Consumer Web
San Francisco, CA
14 Employees
RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
San Francisco, CA
53 Employees
Altana Thumbnail
Software • Machine Learning • Artificial Intelligence
San Francisco, CA
200 Employees
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account