Remote Opportunity

Data Scientist - AI Evaluation

Join Wizard as a senior professional working remotely from Worldwide. Explore the role, benefits, and apply in one place.

Full Time
$225k - $280k
4 months ago
Worldwide
AI Governance & Programs
Senior
Python
Data Science
Machine Learning
+4 more

Job Description

About Wizard

Wizard is the top-performing AI Shopping Agent, delivering the best products from across the web with unmatched accuracy, quality, and trust.

The Role

We’re looking for a Data Scientist to own how we measure, understand and improve the accuracy of our AI agent. This role sits at the intersection of data science, machine learning and product and is focused on evaluation, experimentation and insight generation.  You won’t be building models but you will make sure they work in real world scenarios.  You will build the systems to measure what good looks like and partner closely with ML, AI Engineering and Product to continuously improve the agent’s performance.

What You’ll Do

  • Define and evolve accuracy metrics across the full shopping experience (retrieval, ranking, recommendations and outcomes)
  • Design and run experiments to measure improvements and regressions
  • Build and maintain evaluation datasets, benchmarks and scoring frameworks
  • Translate ambiguous product questions into clear, measurable hypotheses and analysis
  • Partner with ML Engineers to validate model changes and guide iteration
  • Identify failure modes and edge cases and drive improvements through data
  • Create dashboards and reporting that make agent performance visible, trusted and actionable

What Success Looks like

  • Clear, trusted accuracy metrics are consistently used across product and engineering
  • A robust automated evaluation framework exists for both offline and live experiments
  • Model and product changes are consistently measured before and after launch 

Ideal Background

  • 4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles
  • Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc)
  • Strong experience with experimentation (A/B testing, causal inference)
  • Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems
  • Ability to translate messy problems into structured analysis and metrics
  • Strong product mindset, you care about real user outcomes
  • Clear communication with the ability to influence across engineering and product

Compensation & Benefits

The expected base salary range for this role is $225,000 - $280,000 USD, and will vary based on skills, experience, role level, and geographic location. Final compensation will be determined by considering these factors alongside overall role scope and responsibilities.

In addition to base salary, Wizard offers:

  • Equity in the form of stock options
  • Medical, dental, and vision coverage
  • 401(k) plan
  • Flexible PTO and company holidays
  • Fully remote work within the United States
  • Periodic company offsites and team gatherings

Wizard is committed to fair, transparent, and competitive compensation practices.

Requirements

  • 4-6+ years in Data Science, ML Evaluation or Applied AI or similar roles
  • Deep experience evaluating AI/ML systems (ranking, recommendations, LLMs, etc)
  • Strong experience with experimentation (A/B testing, causal inference)
  • Experience working on consumer products or user facing systems and exposure to marketplace or e-commerce systems
  • Ability to translate messy problems into structured analysis and metrics
  • Strong product mindset, you care about real user outcomes
  • Clear communication with the ability to influence across engineering and product

Benefits

  • 401k Matching
  • Equity
  • Health Insurance
  • Remote Work
  • Stock Options
  • Training Budget

Skills

Python
Data Science
Machine Learning
AI
Experimentation
A/B Testing
Causal Inference

Ready to Apply?

Join Wizard today

Salary Range
$225k - $280k
Posted 4 months ago

Explore more remote openings

Browse fresh listings from our global community of remote-friendly teams.

Full Time
$94.8k - $166.2k
5 days ago
United States
Engineering
Senior
Git
Full Time
5 days ago
United States
AI
Senior
Python
AWS
Git
+1 more
Full Time
5 days ago
United States
Data
Mid
Python
API
Full Time
$175.75k - $260k
5 days ago
United States
AI
Executive
AWS
API
Full Time
5 days ago
United States
AI
Mid
API
Full Time
6 days ago
United States
AI
Executive
Git
Full Time
2 weeks ago
Worldwide
AI
Senior
API
Full Time
$145k - $180k
2 weeks ago
United States
AI
Executive
Python
AWS
API
Full Time
$140k - $170k
2 weeks ago
Worldwide
AI
Senior
Python
Git
API
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
2 weeks ago
United States
AI
Executive
Full Time
2 weeks ago
United States
AI
Executive
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
$111.6k - $163.1k
2 weeks ago
United States
AI
Senior
Full Time
$0.03k - $0.035k
2 weeks ago
Worldwide
AI
Entry
Full Time
$145k - $155k
2 weeks ago
United States
AI
Executive
AWS
Git
Full Time
2 weeks ago
United States
AI
Senior
Full Time
$89.865k - $155.767k
2 weeks ago
United States
Product
Mid
Python
Java
AWS
+1 more
Full Time
2 weeks ago
United States
AI
Executive
Git
Full Time
2 weeks ago
United States
AI
Senior
AWS
Git
API
Full Time
2 weeks ago
United States
AI
Executive
AWS
API
Full Time
2 weeks ago
United States
AI
Senior
Full Time
2 weeks ago
United States
AI
Mid
Python
SQL
Full Time
RON 16k - RON 19k
2 weeks ago
United States
AI
Senior
Python
AWS
Full Time
$242k - $302k
2 weeks ago
United States
AI
Executive
API
Full Time
$105k - $235k
2 weeks ago
United States
AI
Senior
AWS
Git
Full Time
$105k - $235k
2 weeks ago
United States
AI
Senior
AWS
Git
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
2 weeks ago
United States
AI
Senior
API
Contract
2 weeks ago
Worldwide
AI
Executive
AWS
API
Contract
2 weeks ago
Worldwide
AI
Executive
AWS
API
Full Time
2 weeks ago
United States
AI
Senior
Full Time
2 weeks ago
United States
AI
Senior
Full Time
2 weeks ago
Worldwide
AI
Senior
AWS