Back to Jobs
Scale AI

Machine Learning Engineer, Model Evaluation

Scale AI
HybridFull Time$180K – $280K/yr🧠 Machine Learning
evaluationbenchmarkingred-teamingLLMAI safety

Job Description

Scale AI is looking for a Machine Learning Engineer to join our Model Evaluation team. You'll build systems and benchmarks to rigorously evaluate the capabilities and safety of frontier AI models for our enterprise and government clients.

You'll work on red-teaming, capability evaluations, and automated testing pipelines that help our clients understand what their models can and cannot do. This is a critical role in ensuring AI systems are deployed responsibly.

Requirements

  • 3+ years of ML engineering experience
  • Strong Python skills
  • Experience with LLM evaluation and benchmarking
  • Familiarity with statistical analysis
  • Understanding of AI safety and alignment concepts
  • Experience with data pipelines and annotation systems is a plus

Benefits

  • Competitive salary and equity
  • Comprehensive health benefits
  • 401(k) with matching
  • Flexible PTO
  • $2,000 annual learning stipend
  • Hybrid work flexibility

Job Details

Posted
April 5, 2026
Expires
May 5, 2026
Views
0
Applies
0

About the Company

Scale AI

Scale AI

San Francisco, CA

Scale AI accelerates the development of AI applications by providing high-quality training data and evaluation infrastructure.