Job Description
Location: Remote (US & Western Europe – France, Germany, Switzerland, Denmark, Finland, Netherlands, Sweden, Iceland, Italy, Austria, Ireland, Norway)
Candidate Eligibility: Only Tier 1 colleges and Tier 1 companies (list attached) – others will be considered for separate projects.
Project Overview
We are building cutting-edge datasets to train, benchmark, and advance large language models for enterprise AI-driven coding solutions. As a Software Engineering Evaluator, you will:
- Curate code examples, provide precise solutions, and correct code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
- Evaluate and refine AI-generated code for efficiency, scalability, and reliability.
- Collaborate with cross-functional teams to enhance enterprise-level AI coding solutions.
- Design verification mechanisms and build agents to automatically validate solutions and identify error patterns.
- Hypothesize on steps in the software engineering cycle (prototyping, architecture, API design, implementation, launch, monitoring, operational maintenance) and evaluate model capabilities on them.
Typical Day
- Curate and correct high-quality code examples across multiple languages.
- Analyze AI-generated code for accuracy, performance, and maintainability.
- Provide structured evaluation rationales and communicate insights clearly.
- Build and validate automated verification systems for software tasks.
Required Skills & Experience
- 5+ years of software engineering experience, including 2+ years at a top-tier product company (Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research, etc.).
- Strong expertise in full-stack application development and deployment of production-grade software.
- Deep knowledge of software architecture, design patterns, debugging, and code quality assessment.
- Excellent oral and written communication skills for clear, structured evaluations.
- Proficiency in Python, JavaScript/ReactJS, C/C++, Java, Rust, and Go.
Vetting & Selection Process
- Interest Form – Candidates must confirm readiness for assessment.
- AI Interview – Quick 20-minute session with our AI interviewing platform QODE.
- Automated Coding Challenge – 30–45 minutes evaluating problem-solving and coding ability.
Target Companies & Universities
- Top Companies: Google, Apple, Amazon, Meta, Netflix, Microsoft, Tesla, NVIDIA, Adobe, Salesforce, Github, Atlassian, Databricks, Snowflake, Cloudflare, DigitalOcean, MongoDB, Elastic, Confluent, Airbnb, Dropbox, Stripe, Palantir, Uber, Lyft, Square (Block), Twilio, Snap Inc., Pinterest, Figma, Oracle, Cisco, Paypal, Doordash, Rivian, Reddit, Coinbase, Splunk, Spotify, Goldman Sachs, Morgan Stanley, JP Morgan Chase, Capital One, Plaid, Shopify, Intuit, Workday, ServiceNow, Hugging Face, VMware, Brex, Wise, Epic Games, Unity Technologies, Activision Blizzard, Riot Games, Valve, Huawei, Bloomberg, ByteDance, Alibaba, Baidu, Notion, Klarna, Instacart, Zillow.