Remote Opportunity

Data Scientist - AI Evaluation

Join Wizard as a senior professional working remotely from Worldwide. Explore the role, benefits, and apply in one place.

Full Time
$225k - $280k
18 hours ago
Worldwide
AI Governance & Programs
Senior
Python
Data Science
Machine Learning
+4 more

Job Description

About Wizard

Wizard is the top-performing AI Shopping Agent, delivering the best products from across the web with unmatched accuracy, quality, and trust.

The Role

We’re looking for a Data Scientist to own how we measure, understand and improve the accuracy of our AI agent. This role sits at the intersection of data science, machine learning and product and is focused on evaluation, experimentation and insight generation.  You won’t be building models but you will make sure they work in real world scenarios.  You will build the systems to measure what good looks like and partner closely with ML, AI Engineering and Product to continuously improve the agent’s performance.

What You’ll Do

  • Define and evolve accuracy metrics across the full shopping experience (retrieval, ranking, recommendations and outcomes)
  • Design and run experiments to measure improvements and regressions
  • Build and maintain evaluation datasets, benchmarks and scoring frameworks
  • Translate ambiguous product questions into clear, measurable hypotheses and analysis
  • Partner with ML Engineers to validate model changes and guide iteration
  • Identify failure modes and edge cases and drive improvements through data
  • Create dashboards and reporting that make agent performance visible, trusted and actionable

What Success Looks like

  • Clear, trusted accuracy metrics are consistently used across product and engineering
  • A robust automated evaluation framework exists for both offline and live experiments
  • Model and product changes are consistently measured before and after launch 

Ideal Background

  • 4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles
  • Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc)
  • Strong experience with experimentation (A/B testing, causal inference)
  • Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems
  • Ability to translate messy problems into structured analysis and metrics
  • Strong product mindset, you care about real user outcomes
  • Clear communication with the ability to influence across engineering and product

Compensation & Benefits

The expected base salary range for this role is $225,000 - $280,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.

In addition to base salary, Wizard offers:

  • Equity in the form of stock options
  • Medical, dental, and vision coverage
  • 401(k) plan
  • Flexible PTO and company holidays
  • Fully remote work within the United States
  • Periodic company offsites and team gatherings

Wizard is committed to fair, transparent, and competitive compensation practices.

Requirements

  • 4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles
  • Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc)
  • Strong experience with experimentation (A/B testing, causal inference)
  • Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems
  • Ability to translate messy problems into structured analysis and metrics
  • Strong product mindset, you care about real user outcomes
  • Clear communication with the ability to influence across engineering and product

Benefits

  • 401k Matching
  • Equity
  • Health Insurance
  • Remote Work
  • Stock Options
  • Training Budget

Skills

Python
Data Science
Machine Learning
AI
Experimentation
A/B Testing
Causal Inference

Ready to Apply?

Join Wizard today

Salary Range
$225k - $280k
Posted 18 hours ago

More AI Governance & Programs roles you might like

Discover similar opportunities from companies that are also hiring remotely.

Full Time
$9149.346k - $1372.402k
15 hours ago
Worldwide
AI Governance & Programs
Senior
Python
Microsoft Entra ID
Azure AD
+5 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Generative AI
+5 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
Copilot Studio
Power Automate
Power Apps
+4 more

Explore more remote openings

Browse fresh listings from our global community of remote-friendly teams.

Full Time
$120,000 - $180,000*
8 hours ago
United Kingdom
Worldwide
Model Risk Management & Validation
Senior
Solvency II
Internal Model
Model Risk
+4 more
Full Time
$120,000 - $180,000*
18 hours ago
Worldwide
AI Security & Privacy
Senior
AI/ML
Security Architecture
Cloud Security
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Canada
Worldwide
Model Risk Management & Validation
Senior
Python
SAS
SQL
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Ireland
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Science
+4 more
Full Time
$120,000 - $180,000*
1 day ago
United States
Worldwide
AI Governance & Programs
Senior
Model Risk Management
AI Governance
Compliance
+4 more
Full Time
CAD 94.6k - CAD 176k
1 day ago
Canada
Worldwide
AI Risk & Controls
Senior
Finance
Model Validation
Risk Management
+3 more
Full Time
$106.23k - $145k
1 day ago
Worldwide
AI Governance & Programs
Senior
AI/ML
Cloud Security
Data Science
+5 more
Full Time
$148k - $274.2k
1 day ago
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Science
+3 more
Full Time
$228.911k - $471.286k
1 day ago
Worldwide
AI Compliance & Legal
Senior
AI
Machine Learning
Data
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
Enterprise Security Architecture
+5 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Security & Privacy
Senior
Python
Adversarial Machine Learning
Enterprise Security Architecture
+5 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
Python
Node.JS
Go
+3 more
Full Time
$147k - $211k
1 day ago
Worldwide
AI Governance & Programs
Senior
C++
API design
Stubby
+3 more
Full Time
$80,000 - $140,000*
1 day ago
Worldwide
AI Risk & Controls
Mid
Data Analysis
Risk Assessment
Automation
+2 more
Full Time
$120,000 - $200,000*
2 days ago
Worldwide
AI Risk & Controls
Senior
Python
Django
Kubernetes
+5 more
Full Time
$98.16k - $159.27k
2 days ago
United States
AI Security & Privacy
Senior
Azure Security Engineer
Microsoft Cybersecurity Architect
CISSP
+4 more
Full Time
$108.75k - $200k
2 days ago
United States
Model Risk Management & Validation
Senior
Quantitative models
Risk Management
Model Risk Management
+4 more
Full Time
$108k - $185k
2 days ago
United States
Worldwide
AI Compliance & Legal
Senior
Artificial Intelligence
Generative AI
Machine Learning
+3 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Generative AI
LLMs
prompt engineering
+5 more
Full Time
$229.9k - $262.4k
2 days ago
United States
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
Cloud Computing
+3 more
Full Time
$120,000 - $180,000*
2 days ago
Canada
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Risk Management
+3 more
Full Time
$80,000 - $140,000*
2 days ago
Germany
Worldwide
Model Risk Management & Validation
Mid
Model Validation
Risk Management
Model Development
+5 more
Full Time
$100,000 - $150,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
AI
Data governance
Regulatory Compliance
+5 more
Full Time
$120,000 - $180,000*
2 days ago
United Kingdom
Worldwide
AI Governance & Programs
Senior
Cloud
AI
API
+3 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
Data Analysis
+4 more
Full Time
$108k - $185k
2 days ago
United States
Worldwide
AI Compliance & Legal
Senior
Artificial Intelligence
Generative AI
Machine Learning
+3 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Governance & Programs
Senior
Data security & privacy
Access control and permissions
API integrations
+5 more
Full Time
$120,000 - $180,000*
3 days ago
Worldwide
AI Governance & Programs
Senior
AWS
Azure
GCP
+5 more
Contract
$120,000 - $180,000*
4 days ago
Worldwide
AI Governance & Programs
Senior
ISO/IEC 42001
AI Lifecycle Governance
Data governance
+4 more
Contract
$120,000 - $180,000*
4 days ago
Worldwide
AI Governance & Programs
Senior
ISO/IEC 42001
Audit Methodology
Artificial Intelligence Management Systems
+1 more
Full Time
$102.8k - $210.2k
4 days ago
Worldwide
AI Compliance & Legal
Senior
Data Analysis
Business Analysis
Data Management
+5 more
Full Time
$197.4k - $246.75k
4 days ago
Worldwide
AI Governance & Programs
Senior
Machine Learning
Generative AI
Python
+5 more
Full Time
$149.084k - $218.657k
4 days ago
Worldwide
AI Security & Privacy
Senior
AI/ML systems
Governance
Model Risk Management
+5 more
Full Time
$80,000 - $160,000*
4 days ago
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Science
+5 more
Full Time
$80,000 - $160,000*
4 days ago
Worldwide
AI Governance & Programs
Senior
Pharmacy
Retail Operations
Customer Service
+5 more