MatX Logo

MatX

ML Performance Engineer

Job Posted 8 Days Ago Reposted 8 Days Ago
Mountain View, CA
Mid level
Mountain View, CA
Mid level
The ML Performance Engineer develops performance models, writes production libraries for ML, and collaborates on solutions from model to hardware.
The summary above was generated by AI

MatX's mission is to make the world’s best AI models run as efficiently as allowed by physics, bringing the world years ahead in AI quality and availability. We are developing vertically integrated full-stack solutions from silicon to systems including hardware and software to train and run the largest ML workloads for AGI. We are looking for people who are excited about systems-focused ML research.

Responsibilities include:

  • Build performance models and tooling to validate and guide scheduling decisions for current and future ML models.
  • Write production-grade libraries for efficient distributed training and serving.
  • Collaborate with architects, hardware, and software teams to drive solutions from model to metal.

Requirements:

  • Bachelor's or Master's degree in Computer Science, Engineering, or a related field
  • Strong programming skills in Python
  • Expertise in ML frameworks such as JAX, PyTorch, or Tensorflow.
  • Deep knowledge of the Transformer architecture.
  • Experience with distributed computing, high performance networking, or large-scale ML systems,

Preferred Skills: 
Any of the following:

  • Hands on experience with flash attention, quantization, pruning, or other systems performance optimizations.
  • Fluency in parallelism strategies that balance computation, communication, and memory to improve throughput and latency of large models.
  • Experience with performance analysis tools and profilers for large scale systems.
  • Solid understanding of computer architecture and low-level optimization techniques
  • A track record of impactful ML Systems Research, through publication and/or industrial practice.
  • Experiences in pre-silicon exploration, post-silicon bringup, and fleet debug.

Compensation: The US base salary for this full-time position is $120,000 - $400,000 + equity + benefits

As part of our dedication to the diversity of our team and our focus on creating an inviting and inclusive work experience, MatX is committed to a policy of Equal Employment Opportunity and will not discriminate against an applicant or employee on the basis of race, color, religion, creed, national origin or ancestry, sex, gender, gender identity, gender expression, sexual orientation, age, physical or mental disability, medical condition, marital/domestic partner status, military and veteran status, genetic information or any other legally recognized protected basis under federal, state or local laws, regulations or ordinances.

All candidates must be authorized to work in the United States and work from our offices in Mountain View Tuesdays-Thursdays.

This position requires access to information that is subject to U.S. export controls. This offer of employment is contingent upon the applicants capacity to perform job functions in compliance with U.S. export control laws without obtaining a license from U.S. export control authorities.


MatX does not accept unsolicited resumes from individual recruiters or third-party recruiting agencies in response to job postings. No fee will be paid to third parties who submit unsolicited candidates directly to our hiring managers or People team and any resumes submitted are deemed to be the property of MatX.

Top Skills

Jax
Python
PyTorch
TensorFlow
HQ

MatX Mountain View, California, USA Office

Mountain View, CA, United States

Similar Jobs

8 Days Ago
San Francisco, CA, USA
Mid level
Mid level
Software
As a Senior Machine Learning Engineer, you'll explore AI techniques, optimize model performance, and collaborate with teams to develop software solutions.
Top Skills: CudaJupyterPythonPyTorch
8 Days Ago
San Francisco, CA, USA
192K-260K Annually
Senior level
192K-260K Annually
Senior level
Big Data • Machine Learning • Software • Analytics • Big Data Analytics
The role involves analyzing performance bottlenecks in ML training, developing tools for performance profiling, and collaborating with researchers on efficiency methods.
Top Skills: CudaCudnnCutlassEigenMklPyTorchTensorFlow
14 Days Ago
Santa Clara, CA, USA
Junior
Junior
Automotive
Design and optimize onboard software systems for deploying large deep learning models in production vehicles, enhancing performance, power efficiency, and latency.
Top Skills: C++CudaNvidia TensorrtPython

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account