Job Description
Location
United States & Western Europe
(France, Germany, Switzerland, Denmark, Finland, Netherlands, Sweden, Iceland, Italy, Austria, Ireland, Norway)
Eligibility
This opportunity is strictly for candidates from Tier-1 universities or Tier-1 technology companies.
If you do not meet these criteria, please ignore this listing — we have other projects available.
Project Overview
We are hiring experienced Software Engineers to work as Engineering Evaluators for advanced AI systems.
You will collaborate with researchers and AI teams to create high-quality datasets used to train and benchmark next-generation large language models. Your expertise will help improve AI-generated code quality across multiple programming languages.
This role involves analyzing, correcting, and validating code across modern software stacks while ensuring production-level standards for efficiency, scalability, and reliability.
Technologies
Engineers should be highly proficient in one or more of the following:
- Python
- JavaScript (including ReactJS)
- C / C++
- Java
- Rust
- Go
What You’ll Do
AI Training & Code Evaluation
- Create high-quality code examples used to train AI models
- Build solutions and correct code across multiple programming languages
- Evaluate AI-generated code for correctness and performance
Engineering Quality Review
- Improve code quality, scalability, and reliability
- Identify edge cases, bugs, and architectural improvements
- Provide structured technical feedback
AI System Collaboration
- Work with cross-functional research and engineering teams
- Help improve AI-driven development tools
- Benchmark AI systems against real-world engineering standards
Automation & Verification
- Build agents that verify code quality automatically
- Identify recurring error patterns in generated code
- Design verification frameworks for software engineering tasks
Software Development Lifecycle Evaluation
Assess AI capabilities across:
- Prototyping
- Architecture design
- API design
- Production implementation
- Launch and experimentation
- Monitoring and operational maintenance
Required Experience
- 5+ years of software engineering experience
- 2+ years at a top-tier product company
- Strong experience building production-grade systems
- Deep understanding of:
- Software architecture
- System design
- Debugging and optimization
- Code reviews and engineering standards
- Excellent written and verbal communication skills
Preferred Background
Candidates from companies such as:
Google, Apple, Amazon, Meta, Netflix, Microsoft, Tesla, NVIDIA, Adobe, Salesforce, GitHub, Atlassian, Databricks, Snowflake, Cloudflare, MongoDB, Stripe, Palantir, Uber, Shopify, Airbnb, Coinbase, Spotify, Bloomberg, ByteDance, Alibaba, and other leading technology organizations.
Hiring Process
Automated coding challenge (30–45 minutes)
Submit application and complete interest form
20-minute AI interview via the QODE platform