AI Model Evaluator (LLM & Agent Systems) at Micro1 | Torre

AI Model Evaluator (LLM & Agent Systems)

You'll refine cutting-edge AI, directly shaping the future of human potential and career matching.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Hybrid (United States)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted about 2 months ago

Requirements and responsibilities


Role involves evaluating outputs from large language models (LLMs) and autonomous agent systems against defined guidelines and rubrics, reviewing multi-step agent actions with supporting materials, applying evaluation standards, providing structured feedback for benchmarking and product refinement, participating in calibration sessions, adapting to evolving scenarios, and documenting findings for stakeholders.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.