Remote Opportunity

Staff Machine Learning Platform Engineer, AI Evaluation

Join Apple as a staff professional working remotely from Worldwide. Explore the role, benefits, and apply in one place.

Full Time
$120,000 - $200,000*
1 day ago
Worldwide
AI Governance & Programs
Staff
Python
Pandas
API design
+5 more

Job Description

Join Apple Services Engineering to build the next generation of AI evaluation systems. We are seeking a staff machine learning platform engineer to lead the architectural design and development of the high availability services and internal tools powering self-service evaluation at scale. You will partner with researchers to operationalize their innovations, transforming complex workflows into intuitive, developer-first platforms. We are looking for builders who thrive in the ambiguity of new initiatives and are passionate about creating scalable infrastructure. DESCRIPTION You will join the engineering team responsible for democratizing AI evaluation across the organization. Your focus will be on developing the developer experience—architecting and implementing the APIs, SDKs, and platform services that turn complex evaluation metrics into simple, self-service calls. You will work hand-in-hand with researchers to operationalize sophisticated measurement techniques, ensuring they scale reliably within our high-availability infrastructure. In this role, you will drive the engineering standards for a new organization, upholding the code quality, automation, and testing rigor required to support the rapid evolution of Generative AI and Agentic systems. MINIMUM QUALIFICATIONS 8+ years of hands-on software engineering experience, with a track record of owning the technical direction of a platform or infrastructure domain. Strong proficiency in the Python ecosystem (e.g., FastAPI, Pydantic, Pandas). You write production-grade code and lead architectural discussions on day one. Customer Obsession & Product Thinking: You have owned the technical roadmap for an internal platform, presented it to senior stakeholders, and shipped against it. You independently translate vague requirements from other teams into concrete engineering specifications and platform roadmaps. Demonstrated experience leading technical partnerships with Data Scientists or Researchers: You have taken research code and shipped it as a production service and built the abstractions, testing frameworks, and deployment pipelines that made the next handoff faster than the last.. Strong expertise in API Design & Platform Infrastructure: You have designed and owned APIs and SDKs that other developers rely on, with a focus on versioning, backward compatibility, and developer experience at scale. Operational excellence background: You have architected and owned CI/CD pipelines, containerization (Docker/Kubernetes), and monitoring (Datadog/Prometheus) for production services, and have been accountable for their reliability. Bachelors in Computer Science or related field, Masters preferred. PREFERRED QUALIFICATIONS Deep familiarity with AI Evaluation Frameworks: You have built, extended, or contributed to modern evaluation tools like DeepEval, Ragas, TruLens, or LangSmith. You understand how to implement and scale model-based evaluation workflows across a large organization. Evaluation Service Deployment: Own the deployment, scaling, and operational health of evaluation services in production - including high-throughput evaluation job orchestration (queueing, prioritization, concurrency, auto-scaling), and defining SLAs for evaluation pipeline latency and availability. Observability & Reliability: Experience instrumenting production ML evaluation pipelines including tracking evaluation job throughput, queue depth, judge model latency SLAs, scoring drift over time, and failure modes specific to non-deterministic LLM-based evaluation workflows. Deep understanding of Generative AI & Agents: You understand the engineering challenges of relying on LLMs and Agents as software components—specifically managing token economics, handling rate limits, and evaluating non-deterministic, multi-step reasoning capabilities. You have built production systems that depend on these components and have solved these problems at scale. Builder Experience: You have thrived in startup-like environments, navigating high ambiguity to deliver complex technical roadmaps from scratch.

Requirements

  • 8+ years of hands-on software engineering experience
  • Strong proficiency in the Python ecosystem
  • Customer Obsession & Product Thinking
  • Demonstrated experience leading technical partnerships with Data Scientists or Researchers
  • Strong expertise in API Design & Platform Infrastructure
  • Operational excellence background
  • Bachelors in Computer Science or related field, Masters preferred

Benefits

  • 401k Matching
  • Certification Support
  • Company Retreats
  • Conference Budget
  • Coworking Space
  • Dental Insurance
  • Disability Insurance
  • Employee Assistance Program

Skills

Python
Pandas
API design
Docker
Kubernetes
FastAPI
Pydantic
Platform Infrastructure

About AI-Estimated Salary

The salary range shown was not provided by the employer. Our AI has estimated it based on the job title, required experience, location, and industry standards (confidence: 80%). This estimate should be used as a general guide only and may not reflect the actual compensation. Always confirm salary details directly with the employer during the application process.

Ready to Apply?

Join Apple today

Salary Range (AI-Estimated)*
$120,000 - $200,000
80% confidence
Posted 1 day ago

More AI Governance & Programs roles you might like

Discover similar opportunities from companies that are also hiring remotely.

Full Time
$102k - $130k
14 hours ago
Worldwide
AI Governance & Programs
Mid
Process Management
Stakeholder management
Governance Frameworks
+5 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
AWS
Cloud Security
Security Engineering
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
MLOps tools
SQL
Python
+4 more

Explore more remote openings

Browse fresh listings from our global community of remote-friendly teams.

Full Time
$186.9k - $220.4k
12 hours ago
Worldwide
AI Security & Privacy
Staff
API
Cloud
Encryption
+5 more
Full Time
$170k - $200k
1 day ago
Worldwide
AI Security & Privacy
Senior
Cloud Security
Application Security
AWS
+5 more
Full Time
$70k - $80k
1 day ago
Worldwide
AI Security & Privacy
Entry
Python
AWS
Azure
+5 more
Full Time
$130k - $145k
1 day ago
Worldwide
AI Security & Privacy
Mid
AWS
Azure
GCP
+5 more
Full Time
$52k - $61.6k
1 day ago
United States
Worldwide
Model Risk Management & Validation
Mid
Model Risk
Risk Management
Program Management
+3 more
Full Time
$120,000 - $180,000*
1 day ago
United States
Worldwide
AI Security & Privacy
Mid
AI/ML
Security
Threat modeling
+5 more
Full Time
$104k - $171.5k
1 day ago
United States
Worldwide
Model Risk Management & Validation
Senior
Model Inventory
Quantitative Risk Management
Risk and Control Frameworks
+3 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
AI
Security
Engineering
Full Time
$100,000 - $150,000*
2 days ago
Worldwide
AI Governance & Programs
Mid
probability theory
stochastic processes
statistics
+5 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
Enterprise Security Architecture
+5 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
Enterprise Security Architecture
+5 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
Enterprise Security Architecture
+5 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Vector DBs
Fine-tuning Pipelines
Python
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
AI Deployment Architectures
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
AI Deployment Architectures
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
AI Deployment Architectures
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
AI Deployment Architectures
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
AI Deployment Architectures
+4 more
Full Time
$100,000 - $150,000*
2 days ago
Netherlands
AI Governance & Programs
Senior
AI
Machine Learning
Risk Management
+2 more
Full Time
$100,000 - $150,000*
2 days ago
Netherlands
AI Governance & Programs
Senior
AI
Machine Learning
Risk Management
+2 more
Full Time
$80,000 - $140,000*
2 days ago
Worldwide
AI Governance & Programs
Mid
Data governance
Project Management
Data & AI Policy
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Senior
AI Security
Machine Learning
Cloud Security
+3 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Quantum Physics
Quantum Optics
Photonics
+4 more
Full Time
$156.8k - $285.6k
2 days ago
United States
Canada
Worldwide
AI Security & Privacy
Senior
API Experience
Backend Engineering
Machine Learning
+3 more
Full Time
CAD 120k - CAD 153.8k
2 days ago
Canada
Worldwide
AI Risk & Controls
Executive
Model Risk Management
Data Management
Stress Testing
+4 more
Full Time
$0.03k - $0.035k
2 days ago
Worldwide
AI Governance & Programs
Senior
AI
Data Annotation
Linguistics
+4 more
Full Time
$0.03k - $0.035k
2 days ago
Worldwide
AI Governance & Programs
Senior
French
AI
Machine Learning
+2 more
Full Time
$0.03k - $0.035k
2 days ago
Worldwide
AI Governance & Programs
Senior
German
AI
Data Annotation
+4 more
Full Time
$0.03k - $0.035k
2 days ago
Worldwide
AI Governance & Programs
Senior
English
AI
Machine Learning
+3 more
Full Time
$0.03k - $0.035k
2 days ago
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Annotation
+2 more
Full Time
$0.03k - $0.035k
2 days ago
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Annotation
+2 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Python
LLMs
Evaluation Frameworks
+2 more
Full Time
$128k - $200k
5 days ago
Worldwide
AI Security & Privacy
Staff
Secrets Management
Security Architecture
Threat modeling
+5 more
Full Time
$150k - $185k
5 days ago
Worldwide
AI Security & Privacy
Senior
Azure
Threat modeling
Networking
+5 more
Full Time
$80,000 - $140,000*
5 days ago
United Kingdom
Worldwide
AI Governance & Programs
Lead
Python
SQL
SAS
+3 more
Full Time
$80,000 - $140,000*
5 days ago
United Kingdom
Worldwide
AI Governance & Programs
Lead
Python
SQL
SAS
+3 more