AI Model Evaluator (LLM & Agent Systems) at Micro1 | Torre

AI Model Evaluator (LLM & Agent Systems)

You will shape the future of AI by refining LLM and agent systems for human intelligence.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Hybrid (United States)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted about 2 months ago

Requirements and responsibilities


Evaluate outputs from large language models (LLMs) and autonomous agent systems against defined guidelines and rubrics. Review multi-step agent actions, including reasoning traces, to determine accuracy and quality. Provide detailed, structured feedback to inform benchmarking, product evolution, and model refinement. Participate in calibration sessions to ensure consistent evaluation criteria. Document findings and communicate insights clearly to relevant stakeholders.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.