Remote Opportunity

Senior Applied Scientist - AI Evaluation & Quality Systems

Join Apple as a senior professional working remotely from Worldwide. Explore the role, benefits, and apply in one place.

Full Time
$120,000 - $180,000*
10 hours ago
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
Large Language Models
+4 more

Job Description

Apple Services Engineering (ASE) powers the AI and LLM features behind experiences that hundreds of millions of users love every day. As these systems increasingly rely on human-in-the-loop evaluation, the quality of our products is directly constrained by the quality of our evaluation systems. We believe that to build exceptional AI, you need exceptional mechanisms to validate the signals used to train and evaluate them. DESCRIPTION The Human-centered AI, Data Quality Operations team is looking for a Senior Applied Scientist to join our growing team. We are building the systems and methodologies that make AI evaluation trustworthy, and scalable — directly shaping how Apple develops and validates AI across products and services. In this role, you will develop novel, scalable quality control solutions, working closely with cross-functional teams to ensure the data powering our AI/ML systems meets the highest standards of accuracy, consistency, and relevance. Your work will span two connected problem spaces. The first is the methodology and tooling that generates reliable ground truth and detects quality failures across human annotation and automated evaluation pipelines. The second is the autonomous QA agents that make those methodologies generalizable across teams and use cases. This role demands fluency across research thinking and engineering execution — you will prototype, validate, and ship. A strong point of view on when not to use a model or agent is as valued here as the ability to build one. MINIMUM QUALIFICATIONS 5+ years of industry experience in applied science or machine learning with demonstrated impact on shipped systems Strong hands-on experience with Large Language Models including prompt engineering and applied use cases such as grading, validation, or classification Strong working knowledge of evaluation methodology for generative AI, including LLM-as-a-judge design, meta-evaluation, and failure mode analysis Familiarity with human-in-the-loop evaluation systems and the operational dynamics that affect data quality at scale Hands-on experience designing ground truth generation pipelines across varied task types and annotation modalities Proficiency in Python and relevant ML frameworks, with production experience building, deploying, and monitoring LLM-based pipelines and agents MS or PhD in Computer Science, Machine Learning, Statistics, or a related quantitative field, or equivalent practical experience PREFERRED QUALIFICATIONS PhD in Computer Science, Machine Learning, Statistics, or a related field Experience designing agent architectures that are configurable and extensible by practitioners who did not build them Hands-on experience building anomaly detection systems for evaluation quality, including drift detection, distribution analysis, and systematic bias identification Strong communication skills with the ability to influence technical direction across cross-functional teams Demonstrated passion for leveraging AI to improve work efficiency and scale

Requirements

  • 5+ years of industry experience in applied science or machine learning
  • Strong hands-on experience with Large Language Models
  • Strong working knowledge of evaluation methodology for generative AI
  • Familiarity with human-in-the-loop evaluation systems
  • Proficiency in Python and relevant ML frameworks
  • MS or PhD in Computer Science, Machine Learning, Statistics, or a related quantitative field

Benefits

  • 401k Matching
  • Certification Support
  • Flexible Hours
  • Gym Membership
  • Health Insurance
  • Home Office Budget
  • Learning Budget
  • Paid Time Off

Skills

Python
Machine Learning
Large Language Models
prompt engineering
Generative AI
Evaluation Methodology
Human-in-the-loop Evaluation

About AI-Estimated Salary

The salary range shown was not provided by the employer. Our AI has estimated it based on the job title, required experience, location, and industry standards (confidence: 80%). This estimate should be used as a general guide only and may not reflect the actual compensation. Always confirm salary details directly with the employer during the application process.

Ready to Apply?

Join Apple today

Salary Range (AI-Estimated)*
$120,000 - $180,000
80% confidence
Posted 10 hours ago

More AI Governance & Programs roles you might like

Discover similar opportunities from companies that are also hiring remotely.

Full Time
$120,000 - $180,000*
7 hours ago
United States
Worldwide
AI Governance & Programs
Senior
GDPR
AI Governance
Machine Learning
+3 more
Full Time
$120,000 - $180,000*
13 hours ago
Worldwide
AI Governance & Programs
Senior
Data governance
Data Architecture
Cloud Computing
+2 more
Full Time
$80,000 - $150,000*
15 hours ago
Worldwide
AI Governance & Programs
Mid
Python
C++
probability theory
+5 more

Explore more remote openings

Browse fresh listings from our global community of remote-friendly teams.

Full Time
$120,000 - $180,000*
9 hours ago
United States
Worldwide
AI Security & Privacy
Senior
AI
Machine Learning
Security
+4 more
Full Time
$171k - $230.534k
12 hours ago
Worldwide
AI Security & Privacy
Senior
AI
Machine Learning
Security
+5 more
Full Time
$120,000 - $180,000*
23 hours ago
Worldwide
AI Governance & Programs
Senior
Python
LLMs
Evaluation Frameworks
+2 more
Full Time
$180k - $200k
1 day ago
United States
Worldwide
AI Governance & Programs
Senior
Asset Liability Management
Model Risk Management
Stress Testing
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
Python
PyTorch
TensorFlow
+5 more
Full Time
$163.2k - $280.5k
1 day ago
Worldwide
AI Security & Privacy
Lead
API
AI/ML
Security
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
Go
Python
Java
+3 more
Full Time
$69k - $170k
1 day ago
United States
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
Model Risk Management
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Science
+2 more
Full Time
$130.5k - $145k
2 days ago
United States
Worldwide
AI Governance & Programs
Senior
AI Policy
AI frameworks
AI Development
+4 more
Full Time
$139.764k - $287.749k
2 days ago
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
Generative AI
+4 more
Full Time
$120,000 - $180,000*
2 days ago
United States
Worldwide
AI Security & Privacy
Senior
AI/ML systems
Cloud Security
Threat Detection
+4 more
Full Time
$147.25k - $215k
2 days ago
Worldwide
Model Risk Management & Validation
Senior
probability theory
stochastic processes
statistics
+5 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Data Protection
AI Governance
Compliance
+1 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Data Protection
AI Governance
Compliance
+1 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Data Protection
AI Governance
Compliance
+1 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Data Protection
AI Governance
Compliance
+1 more
Full Time
$54.4k - $120.75k
2 days ago
Worldwide
AI Governance & Programs
Mid
Risk Management Frameworks
Model Risk
Transparency
+5 more
Full Time
$80,000 - $140,000*
2 days ago
United States
Worldwide
AI Governance & Programs
Mid
Python
Excel
Data Analysis
+3 more
Full Time
$136k - $197k
2 days ago
Worldwide
AI Compliance & Legal
Senior
API
Compliance
Risk Management
+5 more
Full Time
$189.721k - $332.012k
3 days ago
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
AI
+5 more
Full Time
$120,000 - $180,000*
3 days ago
Worldwide
AI Security & Privacy
Senior
AI Security
Machine Learning
Python
+4 more
Full Time
$120,000 - $180,000*
3 days ago
Worldwide
AI Security & Privacy
Senior
Software Development
Testing
Artificial Intelligence
+5 more
Full Time
$120,000 - $180,000*
3 days ago
Worldwide
Model Risk Management & Validation
Senior
Credit Risk Models
Stress Testing
Model Performance
+4 more
Contract
$120,000 - $180,000*
3 days ago
Worldwide
AI Governance & Programs
Senior
Python
Data Analysis
Model Validation
+4 more
Full Time
$120,000 - $180,000*
3 days ago
United States
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Algorithmic tools
+2 more
Full Time
$120,000 - $180,000*
3 days ago
Worldwide
AI Governance & Programs
Senior
Data governance
AI Policy
Regulatory Compliance
+4 more
Full Time
$120,000 - $180,000*
3 days ago
Worldwide
AI Security & Privacy
Senior
Data classification
Governance Frameworks
DLP tools
+3 more
Full Time
$120,000 - $180,000*
5 days ago
Worldwide
AI Compliance & Legal
Executive
Data Protection
Artificial Intelligence
Compliance
+4 more
Full Time
$120,000 - $180,000*
6 days ago
India
Worldwide
AI Governance & Programs
Senior
Java
AWS
SQL
+5 more
Full Time
$169.1k - $270.8k
6 days ago
Worldwide
AI Governance & Programs
Staff
Machine Learning
Generative AI
Python
+1 more
Contract
$0.01k - $0.014k
6 days ago
Worldwide
AI Governance & Programs
Entry
Large Language Models
Problem-solving
Language analysis
+3 more
Contract
$0.01k - $0.014k
6 days ago
Worldwide
AI Governance & Programs
Entry
Italian
Large Language Models
Structured Guidelines
+3 more
Contract
$0.01k - $0.014k
6 days ago
Worldwide
AI Governance & Programs
Entry
Italian
Large Language Models
Structured Guidelines
+3 more
Full Time
$150,000 - $250,000*
6 days ago
Worldwide
AI Security & Privacy
Senior
Cloud Security
AI/ML
IAM
+5 more
Contract
$0.01k - $0.014k
6 days ago
Worldwide
AI Governance & Programs
Entry
Italian
Large Language Models
Problem-solving
+2 more