Remote Opportunity

AI Evaluation Lead

Join Financial Conduct Authority as a lead professional working remotely from United Kingdom. Explore the role, benefits, and apply in one place.

Full Time
GBP 72.1k - GBP 110k
2 weeks ago
United Kingdom
Worldwide
AI Governance & Programs
Lead
GenAI
Machine Learning
Data Science
+4 more

Job Description

Job Title: AI Evaluation Lead Division: Data, Technology and Innovation Department: AI Product Delivery Salary: National (Edinburgh and Leeds) ranging from £72,100 to £100,000 and London from £79,300 to £110,000 per annum (salary offered will be based on skills and experience) This role is graded as: Technical Specialist – Regulatory Your external recruitment contact is Benjamin via [email protected]. Your internal recruitment contact is Lauren via [email protected] Applications must be submitted through our online portal. Applications sent via social media or email will not be accepted. About the FCA and team We regulate financial services firms in the UK, to keep financial markets fair, thriving and effective. By joining us, you’ll play a key part in protecting consumers, driving economic growth, and shaping the future of UK finance services. The Data, Technology and Innovation (DTI) division enables the FCA to be a digital-first, data-led smart regulator by delivering a secure, agile, and cost-effective technology and data ecosystem that drives better decisions, transparency, and operational efficiency. Working alongside the wider AI Programme (which will continue to oversee/coordinate AI activity across the FCA), the department will partner with business leads to shape and deliver work in priority areas — Authorisations, SPC, EMO and Anti‑Money Laundering. Role responsibilities Define and own evaluation frameworks for GenAI outputs covering quality measures such as accuracy relevance robustness and hallucination rates Design curate and govern test datasets and benchmarks to ensure consistent model and solution performance assessment over time Support development of automated evaluation pipelines and operational reporting to embed assurance into delivery Identify assess and mitigate risks in model behaviour for example bias errors safety concerns and edge cases with clear escalation and control recommendations Manage delivery through well-defined work packages setting priorities operating standards and performance objectives including line management of Business Analysts Engage senior stakeholders to understand strategic priorities build a pipeline of scoped and prioritised projects and translate needs into analytics led solutions with clear business value Work in the public interest protecting 40 million UK consumers who rely on financial services and supporting long term economic growth from an industry contributing 12% of UK economic output Manage digital and data-led change by encouraging innovative experiments and working with senior stakeholders while empowering a diverse team to collaborate openly Skills required Minimum: Experience delivering analytics, data science and AI/ML initiatives, including defining success measures, evaluating model/product performance, and applying innovative problem-solving approaches Demonstrated experience leading people (line management, coaching, and performance reviews), capable of overseeing a portfolio of projects and adjusting delivery as priorities shift Effective stakeholder management skills, including working with senior colleagues to translate priorities into well-scoped, prioritised work with clear outcomes and measurable value Essential: Demonstrable experience designing and applying evaluation frameworks for GenAI ML solutions such as accuracy relevance robustness consistency and hallucination or error rates including defining clear acceptance thresholds Experience curating documenting and governing test datasets and benchmarks including version control to enable repeatable assessment and comparability over time Ability to identify assess and mitigate model risks including bias safety concerns data leakage harmful outputs and edge cases and to recommend appropriate controls and escalation routes Experience building or specifying automated evaluation and monitoring approaches such as scripted test runs scoring and dashboards or management information to embed assurance into delivery and ongoing operations Demonstrated analytical skills with the ability to interpret evaluation results communicate uncertainty and limitations translate complex technical concepts for senior stakeholders and make evidence-based recommendations to improve solution performance Consistent delivery discipline including planning and managing evaluation work packages prioritising across competing demands and ensuring outputs meet agreed standards and timelines Clear written documentation skills producing evaluation reports test plans and assurance artefacts that are audit ready and suitable for governance forums Benefits 28 days annual leave plus bank holidays Hybrid model where employees work a minimum of 40% in the office each month (expectation of 50% for senior leaders). Changing from September to a minimum of 50% in the office each month (expectation of 60% for Directors and Executive Directors) Non-contributory pension (8–12% depending on age) and life assurance at eight times your salary Private healthcare with Bupa, income protection, and 24/7 Employee Assistance 35 hours of paid volunteering annually A flexible benefits scheme designed around your lifestyle For a full list of our benefits, and our recruitment process as a whole visit our benefits page. Our values & culture Our colleagues are the key to our success as a regulator. We are committed to fostering a diverse and inclusive culture: one that’s free from discrimination and bias, celebrates difference, and supports colleagues to deliver at their best. We believe that our differences and similarities enable us to be a better organisation – one that makes better decisions, drives innovation, and delivers better regulation. If you require any adjustments due to a disability or condition, your recruiter is here to help - reach out for tailored support. We welcome diverse working styles and aim to find flexible solutions that suit both the role and individual needs, including options like part-time and job sharing where applicable. Disability Confident: our hiring approach We’re proud to be a Disability Confident Employer, and therefore, people or individuals with disabilities and long-term conditions who best meet the minimum criteria for a role will go through to the next stage of the recruitment process. In cases of high application volumes, we may progress applicants whose experience most closely matches the role’s key requirements. Useful information and timeline Advert Closing: 19th May CV Review/Shortlist: 22nd May First Stage Interviews W/C: 1st June Second Stage Interviews W/C: 8th June Your Recruiter will discuss the process in detail with you during screening for the role, therefore, please make them aware if you are going to be unavailable for any date during this time. At the FCA, we’re creating a fair and more resilient financial system. We’re establishing more transparent relationships between financial services and their customers, building trust in financial markets and protecting vulnerable consumers. Click here to learn more about the FCA.

Requirements

  • Define and own evaluation frameworks for GenAI outputs
  • Design curate and govern test datasets and benchmarks
  • Support development of automated evaluation pipelines
  • Manage delivery through well-defined work packages
  • Engage senior stakeholders to understand strategic priorities
  • Work in the public interest protecting 40 million UK consumers

Benefits

  • Flexible Hours
  • Gym Membership
  • Health Insurance
  • Home Office Budget
  • Learning Budget
  • Performance Bonus
  • Remote Work

Skills

GenAI
Machine Learning
Data Science
Python
API
Data Analysis
Model Evaluation

Ready to Apply?

Join Financial Conduct Authority today

Salary Range
GBP 72.1k - GBP 110k
Posted 2 weeks ago

More AI Governance & Programs roles you might like

Discover similar opportunities from companies that are also hiring remotely.

Full Time
$7048.161k - $1061.802k
21 hours ago
United States
Worldwide
AI Governance & Programs
Senior
Python
SQL
AI/ML
+4 more
Full Time
$135k - $150k
23 hours ago
Worldwide
AI Governance & Programs
Mid
Python
Machine Learning
LLM
+4 more
Full Time
$120,000 - $180,000*
1 day ago
Worldwide
AI Governance & Programs
Senior
Data governance
AI Policy
Risk Management
+5 more

Explore more remote openings

Browse fresh listings from our global community of remote-friendly teams.

Full Time
$120,000 - $180,000*
22 hours ago
Worldwide
Americas
AI Security & Privacy
Senior
AI
Machine Learning
Cyber Security
+3 more
Full Time
$150,000 - $250,000*
1 day ago
Worldwide
AI Security & Privacy
Senior
OWASP ZAP
Nmap
Postman
+5 more
Full Time
$150k - $200k
1 day ago
Worldwide
AI Governance & Programs
Mid
AI
Python
Clinical AI
+5 more
Full Time
$120,000 - $180,000*
1 day ago
Australia
Worldwide
AI Governance & Programs
Senior
Data governance
AI Ethics
Regulatory Compliance
+3 more
Full Time
$120,000 - $180,000*
1 day ago
Australia
Worldwide
AI Governance & Programs
Senior
Data governance
AI Ethics
Regulatory Compliance
+3 more
Full Time
$85k - $95k
1 day ago
United States
Model Risk Management & Validation
Senior
Model Risk Management
Quantitative Risk Management
Financial Modeling
+4 more
Full Time
$85k - $95k
1 day ago
United States
Model Risk Management & Validation
Senior
Model Risk Management
Quantitative Risk Management
Financial Modeling
+5 more
Full Time
$80,000 - $140,000*
1 day ago
United States
AI Risk & Controls
Mid
Excel
SQL
Python
+1 more
Full Time
$80,000 - $120,000*
1 day ago
United States
Model Risk Management & Validation
Mid
Excel
SQL
Python
+1 more
Full Time
$129k - $175k
2 days ago
Worldwide
AI Audit / Assurance / Controls Testing
Senior
API
Automation
Python
+3 more
Full Time
$129k - $175k
2 days ago
Worldwide
AI Audit / Assurance / Controls Testing
Senior
API
Automation
Python
+3 more
Full Time
$119.7k - $191.1k
2 days ago
Worldwide
AI Governance & Programs
Senior
Risk Management
Model Risk
Governance
+5 more
Full Time
$120,000 - $180,000*
2 days ago
Ireland
Worldwide
AI Compliance & Legal
Senior
Data Protection
AI Compliance
Regulatory Requirements
+3 more
Full Time
$100,000 - $150,000*
2 days ago
Worldwide
AI Governance & Programs
Mid
AI/ML Concepts
Tableau
JIRA
+1 more
Full Time
$204k - $255k
2 days ago
Worldwide
AI Policy, Enablement & Training
Senior
AI
Machine Learning
Policy Development
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Staff
Python
ISO 27001
ISO 27701
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Staff
Python
ISO 27001
ISO 27701
+4 more
Full Time
$120,000 - $180,000*
2 days ago
Worldwide
AI Security & Privacy
Staff
Python
Adversarial Machine Learning
AI Deployment Architectures
+4 more
Full Time
Up to PHP 150k
2 days ago
Worldwide
AI Security & Privacy
Senior
PyTorch
TensorFlow
Containerized Environments
+4 more
Full Time
Up to PHP 150k
2 days ago
Worldwide
AI Security & Privacy
Senior
PyTorch
TensorFlow
Gradient-based attacks
+4 more
Full Time
$209k - $309k
4 days ago
Worldwide
AI Security & Privacy
Senior
API
AI
Security
+1 more
Full Time
$239.5k - $351.5k
4 days ago
Worldwide
AI Security & Privacy
Senior
API
AI
Security
+1 more
Full Time
$230k - $280k
4 days ago
United States
Worldwide
AI Governance & Programs
Senior
OWASP
NIST AI RMF
AI/ML systems
+5 more
Full Time
$230k - $280k
4 days ago
Worldwide
AI Governance & Programs
Senior
Agentic Trust Framework
OWASP
NIST AI RMF
+5 more
Full Time
$120,000 - $180,000*
5 days ago
Worldwide
AI Security & Privacy
Senior
Python
Go
Git
+5 more
Full Time
$159.3k - $273.2k
5 days ago
Worldwide
AI Governance & Programs
Senior
Python
Machine Learning
Data Science
+5 more
Full Time
$120,000 - $180,000*
5 days ago
Worldwide
AI Security & Privacy
Staff
Python
Go
Threat modeling
+3 more
Full Time
$80,000 - $140,000*
5 days ago
Worldwide
AI Governance & Programs
Mid
Responsible AI
ISO/IEC 42001
ISO/IEC 27001
+2 more
Full Time
$120,000 - $180,000*
5 days ago
United States
Worldwide
AI Governance & Programs
Senior
AI Ethics
Risk Management
AI governance frameworks
+5 more
Full Time
$120,000 - $180,000*
6 days ago
Worldwide
AI Security & Privacy
Senior
Security Operations
Cybersecurity
NG-SIEM
+5 more
Full Time
$163k - $237k
6 days ago
Worldwide
AI Governance & Programs
Senior
API
Product Management
AI
+4 more
Full Time
$80,000 - $140,000*
6 days ago
United States
Worldwide
AI Governance & Programs
Mid
Python
Data Analysis
Financial Data
+3 more
Full Time
$80,000 - $140,000*
6 days ago
United States
Worldwide
AI Governance & Programs
Mid
Python
Data Analysis
Machine Learning
+2 more
Full Time
$80,000 - $140,000*
6 days ago
Worldwide
AI Governance & Programs
Mid
Python
Excel
Google Sheets
+4 more
Full Time
$120,000 - $180,000*
6 days ago
Australia
Worldwide
AI Governance & Programs
Senior
AI
Machine Learning
Data Science
+4 more
Full Time
$120,000 - $180,000*
6 days ago
Worldwide
AI Governance & Programs
Senior
AI Governance
Model Risk Management
Regulatory Compliance
+5 more
Full Time
$120,000 - $180,000*
1 weeks ago
Worldwide
AI Governance & Programs
Senior
Python
ML frameworks
LLM/GenAI tooling
+2 more