Cartesia Logo

Cartesia

Research Engineer

Job Posted Yesterday Posted Yesterday
Be an Early Applicant
San Francisco, CA
Senior level
San Francisco, CA
Senior level
The Research Engineer will improve pretrained models' quality and efficiency, implement new architectures, build training infrastructure, and keep up with new research.
The summary above was generated by AI

About Cartesia

Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens—let alone do this on-device.

We're pioneering the model architectures that will make this possible. Our founding team met as PhDs at the Stanford AI Lab, where we invented State Space Models or SSMs, a new primitive for training large-scale foundation models. Our team combines deep expertise in model innovation and systems engineering paired with a design-minded product engineering team to build and ship cutting edge models and experiences.

We're funded by leading investors at Index Ventures and Lightspeed Venture Partners, along with Factory, Conviction, A Star, General Catalyst, SV Angel, Databricks and others. We're fortunate to have the support of many amazing advisors, and 90+ angels across many industries, including the world's foremost experts in AI.

Role responsibilities

Your main responsibility will be to push the quality, efficiency and capabilities of our pretrained models, in collaboration with a variety of machine learning, data and systems engineering stakeholders.

  • implement new model backbones, architectures and training algorithms,

  • rapidly run and iterate on experiments and ablations,

  • build training infrastructure that scales to massive multimodal datasets,

  • stay up-to-date on new research ideas.

What we’re looking for

Given the scale and difficulty of problems we work on, we value strong engineering skills at Cartesia.

  • Strong engineering skills, comfortable navigating complex codebases and monorepos.

  • Deep machine learning background, including a strong grasp of fundamentals in sequence modeling, generative models and common model architecture families (RNNs, CNNs, Transformers).

  • Experienced model trainer, ideally previously wrote and pretrained large-scale models.

  • Proficient in Python and Pytorch (or similar framework) and tensor programming more broadly.

  • Familiarity with efficiency tradeoffs in designing model architectures for accelerators such as GPUs.

  • At least 5 years of experience in implementing and training models including advanced degrees (MS/PhD, if any).

  • [bonus] Prior research experience in advancing state space models or implementing them in practice.

  • [bonus] Experience in optimizing model inference with CUDA, Triton or other frameworks.

Even if you don’t meet every requirement above, we'd encourage you to apply.

Our culture

🏢 We’re an in-person team based out of San Francisco. We love being in the office, hanging out together and learning from each other everyday.

🚢 We ship fast. All of our work is novel and cutting edge, and execution speed is paramount. We have a high bar, and we don’t sacrifice quality and design along the way.

🤝 We support each other. We have an open and inclusive culture that’s focused on giving everyone the resources they need to succeed.

Our perks

🍽 Lunch, dinner and snacks at the office.

🏥 Fully covered medical, dental, and vision insurance for employees.

🏦 401(k).

✈️ Relocation and immigration support.

🦖 Your own personal Yoshi.

Top Skills

Cuda
Python
PyTorch

Similar Jobs

7 Days Ago
2 Locations
Senior level
Senior level
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
Develop, optimize, and deploy machine learning models for video generation; mentor junior members and collaborate on research applications.
Top Skills: Deep Learning FrameworksDiffusion ModelsGenerative ModelsMl ModelsPythonVaesVideo Generation
7 Days Ago
San Francisco, CA, USA
Junior
Junior
Consumer Web • Digital Media • Enterprise Web • Marketing Tech • News + Entertainment • Software • Generative AI
The role involves implementing and testing machine learning models, handling video datasets, and collaborating to develop new features. Ideal for early-career professionals eager to learn.
Top Skills: PythonPyTorchTensorFlow
10 Days Ago
Hybrid
Santa Clara, CA, USA
236K-413K Annually
Senior level
236K-413K Annually
Senior level
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead the development of AI solutions using Large Language Models to enhance Enterprise Language Generation, ensuring high-quality software delivery.
Top Skills: AICloud EnvironmentData Processing PipelinesDistributed SystemsLarge Language ModelsMachine Learning

What you need to know about the San Francisco Tech Scene

San Francisco and the surrounding Bay Area attracts more startup funding than any other region in the world. Home to Stanford University and UC Berkeley, leading VC firms and several of the world’s most valuable companies, the Bay Area is the place to go for anyone looking to make it big in the tech industry. That said, San Francisco has a lot to offer beyond technology thanks to a thriving art and music scene, excellent food and a short drive to several of the country’s most beautiful recreational areas.

Key Facts About San Francisco Tech

  • Number of Tech Workers: 365,500; 13.9% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Google, Apple, Salesforce, Meta
  • Key Industries: Artificial intelligence, cloud computing, fintech, consumer technology, software
  • Funding Landscape: $50.5 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Sequoia Capital, Andreessen Horowitz, Bessemer Venture Partners, Greylock Partners, Khosla Ventures, Kleiner Perkins
  • Research Centers and Universities: Stanford University; University of California, Berkeley; University of San Francisco; Santa Clara University; Ames Research Center; Center for AI Safety; California Institute for Regenerative Medicine
By clicking Apply you agree to share your profile information with the hiring company.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account