AI Engineer, Large Scale Model Evaluation

Posted 2 Days Ago
Be an Early Applicant
Sunnyvale, CA
Entry level
Artificial Intelligence • Robotics • Automation • Manufacturing
The Role
As a Model Evaluator at Figure, you will lead user studies and data collection for AI model evaluations, focusing on developing methodologies for assessing model performance, collaborating with teams to create evaluation frameworks, and implementing monitoring pipelines for continuous evaluation.
Summary Generated by Built In

Figure is an AI Robotics company developing a general purpose humanoid. Our humanoid robot, Figure 02, is designed for commercial tasks and the home. We are based in Sunnyvale, CA and require 5 days/week in-office collaboration. It’s time to build.

Figure’s vision is to deploy autonomous humanoids at a global scale. Our AI team is looking for a Model Evaluator to take out learned robot models to the next level. As a key member of our AI team, you will be responsible for leading user studies, data collection efforts, and evaluations for AI models across multiple modalities. Your work will directly impact robots that we ship into the real world to perform useful work. 

Responsibilities: 

  • Evaluate Model Performance: Develop, implement, and refine rigorous methodologies for assessing AI model accuracy, robustness, and efficiency across multiple modalities (e.g. vision, language, proprioception)
  • Framework & Tooling Development: Collaborate with internal teams to build or integrate new evaluation frameworks, simulation environments, and metrics tailored to our humanoid robot applications
  • Baseline & Benchmark Creation: Establish and maintain benchmarks, gold-standard datasets, and systematic test procedures for continued performance comparisons of new and existing models
  • Continuous Evaluation & Monitoring: Implement ongoing monitoring pipelines to detect model drift or performance degradation, and propose retraining strategies when necessary
  • Collaboration with Engineering Teams: Work closely with roboticists, software engineers, and data scientists to ensure end-to-end integration of model evaluation feedback into production systems

Requirements: 

  • The ideal candidate will have a strong computer science background, excellent attention to detail, and a passion to make an impact
  • Track record building and maintaining distributed systems
  • Excellent communication skills
  • Thrive in a high pace environment, where solutions are often unclear and require exploration

Bonus Qualifications: 

  • Prior experience working with robotic learning systems or large generative models


The US base salary range for this full-time position is between $150,000 - $275,000 annually.

The pay offered for this position may vary based on several individual factors, including job-related knowledge, skills, and experience. The total compensation package may also include additional components/benefits depending on the specific role. This information will be shared if an employment offer is extended. 


Top Skills

AI
Robotics
The Company
86 Employees
On-site Workplace
Year Founded: 2022

What We Do

Figure is an AI Robotics company building the world's first commercially viable autonomous humanoid robot. We are based in Sunnyvale, CA.

Similar Jobs

Toast Logo Toast

Senior Director, Consumer Growth

Cloud • Fintech • Food • Information Technology • Software • Hospitality
Remote
San Francisco, CA, USA
4500 Employees
214K-342K Annually

Anduril Logo Anduril

GSOC Manager

Aerospace • Artificial Intelligence • Hardware • Robotics • Security • Software • Defense
Costa Mesa, CA, USA
1400 Employees
114K-171K Annually

Verkada Inc Logo Verkada Inc

Global Sourcing Manager

Cloud • Hardware • Security • Software
San Mateo, CA, USA
2000 Employees
90K-160K Annually

General Motors Logo General Motors

JR-202422862 Sr. Manager Vehicle Software

Automotive • Big Data • Information Technology • Robotics • Software • Transportation • Manufacturing
Hybrid
Mountain View, CA, USA
165000 Employees
225K-345K Annually

Similar Companies Hiring

RunPod Thumbnail
Software • Infrastructure as a Service (IaaS) • Cloud • Artificial Intelligence
San Francisco, CA
53 Employees
EliseAI Thumbnail
Real Estate • Natural Language Processing • Machine Learning • Healthtech • Artificial Intelligence
San Francisco, CA
165 Employees
Altana Thumbnail
Software • Machine Learning • Artificial Intelligence
San Francisco, CA
200 Employees

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account