Remote Opportunity

Senior Software Engineer – LLM Evaluation

Join Careerflow.ai as a senior professional working remotely from Worldwide. Explore the role, benefits, and apply in one place.

Full Time
$0.3k+
4 months ago
Worldwide
AI Governance & Programs
Senior
Python
JavaScript
C/C++
+5 more

Job Description

Project Overview

As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go; evaluating and refining AI-generated code for efficiency, scalability, and reliability; and working with cross-functional teams to enhance enterprise-level AI-driven coding solutions.

What Does a Typical Day Look Like?

  • Working on AI model training initiatives by curating code examples, building solutions, and correcting code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.

  • Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable.

  • Collaborate withcross-functionalteamsto enhance AI-drivencoding solutionsagainstindustryperformance benchmarks.

  • Build agents that can verify the quality of the code and identify error patterns.

  • Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them

  • Design verification mechanisms that can automatically verify a solution to a software engineering task.

Required Skills

  • Several years of software engineering experience (+5 years), including 2+years of continuous full-time experience at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).

  • Strong expertise in building full-stack applications and deploying scalable, production-grade software using modern languages and tools.

  • Deep understanding of software architecture, design, development, debugging, and code quality/review assessment.

  • Excellent oral and written communication skills for clear, structured evaluation rationales.

Vetting process:

  1. The candidate must apply using this link

  2. Next step, candidates will have a quick 20m AI interview with our AI interviewing platform QODE.

  3. After that, they will need to take the Vetsmith with 1 automated coding challenge (this would take from 30 - 45m)

Top companies:

Google (Alphabet), Apple, Amazon, Meta (Facebook), Netflix, Microsoft, Tesla, NVIDIA, Adobe, Salesforce, Github, Atlassian, hashiCorp, Databricks, Snowflake, Cloudflare, DigitalOcean, MongoDB, Elastic, Confluent, Airbnb, Dropbox, Stripe, Palantir, Uber, Lyft, Square (Block), Twilio, Snap Inc., Pinterest, Figma, Oracle, Cisco, Paypal, Doordash, Rivian, Reddit, Coinbase, Splunk, Spotify, Goldman Sachs, Morgan Stanley, JP Morgan Chase, Capital One, Plaid, Shopify, Intuit, Workday, ServiceNow, Hugging Face, VMware, Brex, Wise, Epic Games, Unity Technologies, Activision Blizzard, Riot Games, Valve, Huawei, Bloomberg, ByteDance, Alibaba, Baidu, Notion, Klarna, Instacart, Zillow.

Requirements

  • Several years of software engineering experience (+5 years)
  • 2+ years of continuous full-time experience at a top-tier product company
  • Strong expertise in building full-stack applications
  • Deep understanding of software architecture, design, development, debugging, and code quality/review assessment
  • Excellent oral and written communication skills for clear, structured evaluation rationales
  • Ability to work with cross-functional teams
  • Experience with AI model training initiatives
  • Ability to evaluate and refine AI-generated code

Benefits

  • 401k Matching
  • Certification Support
  • Flexible Hours
  • Health Insurance
  • Home Office Budget
  • Learning Budget
  • Paid Time Off
  • Remote Work

Skills

Python
JavaScript
C/C++
Java
Go
ReactJS
Rust
AI-generated code

Ready to Apply?

Join Careerflow.ai today

Salary Range
$0.3k+
Posted 4 months ago

Explore more remote openings

Browse fresh listings from our global community of remote-friendly teams.

Full Time
$94.8k - $166.2k
4 days ago
United States
Engineering
Senior
Git
Full Time
5 days ago
United States
AI
Senior
Python
AWS
Git
+1 more
Full Time
5 days ago
United States
Data
Mid
Python
API
Full Time
$175.75k - $260k
5 days ago
United States
AI
Executive
AWS
API
Full Time
5 days ago
United States
AI
Mid
API
Full Time
5 days ago
United States
AI
Executive
Git
Full Time
2 weeks ago
Worldwide
AI
Senior
API
Full Time
$145k - $180k
2 weeks ago
United States
AI
Executive
Python
AWS
API
Full Time
$140k - $170k
2 weeks ago
Worldwide
AI
Senior
Python
Git
API
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
2 weeks ago
United States
AI
Executive
Full Time
2 weeks ago
United States
AI
Executive
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
$111.6k - $163.1k
2 weeks ago
United States
AI
Senior
Full Time
$0.03k - $0.035k
2 weeks ago
Worldwide
AI
Entry
Full Time
$145k - $155k
2 weeks ago
United States
AI
Executive
AWS
Git
Full Time
2 weeks ago
United States
AI
Senior
Full Time
$89.865k - $155.767k
2 weeks ago
United States
Product
Mid
Python
Java
AWS
+1 more
Full Time
2 weeks ago
United States
AI
Executive
Git
Full Time
2 weeks ago
United States
AI
Senior
AWS
Git
API
Full Time
2 weeks ago
United States
AI
Executive
AWS
API
Full Time
2 weeks ago
United States
AI
Senior
Full Time
2 weeks ago
United States
AI
Mid
Python
SQL
Full Time
RON 16k - RON 19k
2 weeks ago
United States
AI
Senior
Python
AWS
Full Time
$242k - $302k
2 weeks ago
United States
AI
Executive
API
Full Time
$105k - $235k
2 weeks ago
United States
AI
Senior
AWS
Git
Full Time
$105k - $235k
2 weeks ago
United States
AI
Senior
AWS
Git
Full Time
2 weeks ago
United States
AI
Senior
API
Full Time
2 weeks ago
United States
AI
Senior
API
Contract
2 weeks ago
Worldwide
AI
Executive
AWS
API
Contract
2 weeks ago
Worldwide
AI
Executive
AWS
API
Full Time
2 weeks ago
United States
AI
Senior
Full Time
2 weeks ago
United States
AI
Senior
Full Time
2 weeks ago
Worldwide
AI
Senior
AWS