AI Model Evaluator (LLM & Agent Systems) at Micro1 | Torre

AI Model Evaluator (LLM & Agent Systems)

You'll shape AI quality to match 1 billion people with their dream roles.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Hybrid (United States)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted about 2 months ago

Requirements and responsibilities


Evaluate and benchmark LLM and agent system outputs. Design test cases and evaluation frameworks. Analyze reasoning, hallucinations, bias, and safety issues. Provide structured feedback to improve model quality. Contribute to prompt engineering and performance optimization. Document findings and communicate insights clearly.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.