Member of Technical Staff, Coding Research at Micro1 | Torre

Member of Technical Staff, Coding Research

You'll shape frontier AI by designing benchmarks and evaluating next-generation coding agents.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Compensation
USD220k - 500k/year
location_on
Remote (anywhere)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
3 days ago

Requirements and responsibilities


The RoleWe are seeking a Member of Technical Staff to help advance the evaluation and development of frontier coding agents. Sitting at the intersection of AI research, software engineering, and model evaluation, you will design the benchmarks, methodologies, and data systems that shape how next-generation coding models are measured and improved.What You'll DoDesign and own evaluation frameworks for coding agents, including benchmark specifications, scoring methodologies, rubrics, and quality standards.Lead end-to-end research initiatives focused on measuring and improving coding model performance across diverse software engineering tasks.Develop high-quality datasets, golden examples, and evaluation protocols that enable reliable assessment of frontier coding systems.Analyze model behavior and failure modes, identifying systematic weaknesses and translating findings into actionable improvements for training and evaluation.Build tooling and infrastructure that support large-scale experimentation, data generation, review workflows, and evaluation pipelines.Establish best practices for coding-agent assessment, ensuring methodological rigor, reproducibility, and measurement quality.Partner closely with researchers, engineers, and applied AI teams to design experiments and evaluate emerging model capabilities.Contribute to technical reports, benchmark studies, and client-facing research initiatives that communicate model performance and insights.What We're Looking ForStrong software engineering background with expertise in Python, C++, or comparable programming languages.3+ years of experience in software engineering, machine learning, AI research, evaluation, or related technical disciplines.Experience designing, reviewing, or validating technical assessments, benchmarks, coding tasks, or evaluation methodologies.Familiarity with large language models, coding agents, reinforcement learning, model evaluation, or related AI systems.Proven ability to build tooling, automate workflows, and improve technical processes through systematic experimentation.Strong analytical skills with the ability to investigate model behavior and derive insights from complex technical systems.Excellent written and verbal communication skills, including the ability to clearly articulate technical findings to diverse audiences.Comfortable operating in fast-moving research environments with significant ambiguity and evolving priorities.PreferredExperience working on frontier AI systems, coding agents, or model evaluation research.Deep interest in understanding how data, evaluations, and feedback mechanisms influence model capabilities.Track record of independently driving ambiguous technical or research projects from conception to execution.Experience designing benchmarks or datasets for machine learning systems at scale.Familiarity with agentic workflows, tool use, reinforcement learning, or post-training methodologies.Publications, open-source contributions, or demonstrated technical leadership in AI, machine learning, or software engineering.Compensation & Benefits NoticeThe national pay range for this full-time position is base salary of $140,000 –$180,000 USD. All employees are eligible for equity compensation, and employees may also receive performance-based bonuses, dependent on role and subject to company policies. micro1 provides a comprehensive benefits package, including up to 100% reimbursement for health-insurance premiums, paid time off, a 401(K) plan with a company match, and additional benefits designed to support a high-performing, remote-first workforce.micro1 is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex (including pregnancy, sexual orientation, or gender identity), national origin, age, disability, genetic information, veteran status, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation during the application process, reach out to support@micro1.ai.Our hiring process utilizes artificial intelligence tools to assist in candidate screening and assessment. Our AI tools are designed to complement, not replace, human decision-making.DisclaimerThe information contained in this job posting, including but not limited to role responsibilities, qualifications, compensation, and benefits, is provided for informational purposes only and does not constitute a binding offer of employment. micro1 reserves the right to amend, modify, or withdraw any portion of this posting at its sole discretion and without prior notice. All employment decisions are made in accordance with applicable laws and regulations.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.