Job Description
Project Overview
As a Software Engineering Evaluator, you will:
- Curate code examples, provide precise solutions, and correct code in Python, JavaScript (including ReactJS), C/C++, Java, Rust, and Go.
- Evaluate and refine AI-generated code for efficiency, scalability, reliability, and maintainability.
- Collaborate with cross-functional teams to enhance enterprise-level AI-driven coding solutions.
- Build agents and verification mechanisms to automatically validate software engineering tasks.
- Analyze software engineering steps (prototyping, architecture, API design, production deployment, monitoring) to assess AI model capabilities.
Daily Responsibilities
- Work on AI model training initiatives by curating examples and correcting code.
- Review and optimize AI-generated code for performance, security, and best practices.
- Collaborate with researchers and engineers to design robust, industry-standard benchmarks.
- Identify error patterns in code and propose improvements.
- Provide structured, clear, and well-justified evaluation rationales.
Required Skills & Experience
- 5+ years of software engineering experience, including 2+ years at a top-tier product company (e.g., Google, Stripe, Amazon, Apple, Meta, Netflix, Microsoft, Datadog, Dropbox, Shopify, PayPal, IBM Research).
- Strong expertise in full-stack development and deploying scalable, production-grade applications.
- Deep understanding of software architecture, design patterns, debugging, and code quality assessment.
- Experience across multiple programming languages: Python, JavaScript/ReactJS, C/C++, Java, Rust, Go.
- Excellent communication skills for articulating evaluation rationales clearly.
Vetting Process
- Submit application and complete the Interest Form confirming readiness for assessment.
- Participate in a 20-minute AI interview via our platform QODE.
- Complete a 30–45 minute automated coding challenge.
Target Companies for Candidate Eligibility
Google (Alphabet), Apple, Amazon, Meta (Facebook), Netflix, Microsoft, Tesla, NVIDIA, Adobe, Salesforce, GitHub, Atlassian, HashiCorp, Databricks, Snowflake, Cloudflare, DigitalOcean, MongoDB, Elastic, Confluent, Airbnb, Dropbox, Stripe, Palantir, Uber, Lyft, Square (Block), Twilio, Snap Inc., Pinterest, Figma, Oracle, Cisco, PayPal, DoorDash, Rivian, Reddit, Coinbase, Splunk, Spotify, Goldman Sachs, Morgan Stanley, JP Morgan Chase, Capital One, Plaid, Shopify, Intuit, Workday, ServiceNow, Hugging Face, VMware, Brex, Wise, Epic Games, Unity Technologies, Activision Blizzard, Riot Games, Valve, Huawei, Bloomberg, ByteDance, Alibaba, Baidu, Notion, Klarna, Instacart, Zillow.