AI Evaluation Specialist at Kasama | Torre
warning

Heads-up

The job you’re trying to post already exists in Torre:

AI Evaluation Specialist

You'll shape AI accuracy by meticulously evaluating outputs and refining system instructions.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Freelance
A project
Compensation
USD700 - 2.5k/month
Negotiable
location_on
Remote (anywhere)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted 5 months ago

Responsibilities and deliverables


RESPONSIBILITIES ● Carefully read and understand detailed AI system instructions, evaluation criteria, and task guidelines. ● Analyze and compare multiple AI-generated responses to determine which better follows the instructions. ● Evaluate outputs based on accuracy, completeness, logic, clarity, and alignment with requirements. ● Identify issues such as missed instructions, inconsistencies, hallucinations, or technical errors. ● Provide clear, structured written explanations that justify evaluation decisions. ● Document findings in the required format, ensuring accuracy and consistency. ● Work with structured data formats (such as CSV or JSON) to review and record evaluation results. ● Apply critical thinking and technical reasoning rather than relying on assumptions or surface-level judgments. ● Follow quality standards and feedback to continuously improve evaluation accuracy. ● Maintain confidentiality and professionalism when handling proprietary materials. REQUIREMENTS ● Strong English reading comprehension and writing ability, with attention to nuance and detail. ● Background in a technical field such as Information Technology, Computer Science, Engineering, DevOps, or a related discipline. ● Ability to interpret complex written instructions and apply them precisely. ● Excellent analytical and logical reasoning skills. ● Experience working with structured data formats like CSV or JSON. ● Capability to write concise, human-sounding explanations that clearly communicate reasoning. ● High level of accuracy, focus, and quality control when reviewing content. ● Self-discipline to work independently in a remote, task-based environment. ● Reliable internet connection and ability to meet task deadlines.

Indefinitely open

tune NOT FOR YOU? IMPROVE YOUR RESULTS
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.