AI Evaluation Specialist at Kasama | Torre

AI Evaluation Specialist

You'll shape AI accuracy by meticulously evaluating outputs and refining system instructions.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Freelance
A project
Compensation
USD700 - 2.5K/month
Negotiable
location_on
Remote (anywhere)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted about 1 month ago

Responsibilities and deliverables


RESPONSIBILITIES ● Carefully read and understand detailed AI system instructions, evaluation criteria, and task guidelines. ● Analyze and compare multiple AI-generated responses to determine which better follows the instructions. ● Evaluate outputs based on accuracy, completeness, logic, clarity, and alignment with requirements. ● Identify issues such as missed instructions, inconsistencies, hallucinations, or technical errors. ● Provide clear, structured written explanations that justify evaluation decisions. ● Document findings in the required format, ensuring accuracy and consistency. ● Work with structured data formats (such as CSV or JSON) to review and record evaluation results. ● Apply critical thinking and technical reasoning rather than relying on assumptions or surface-level judgments. ● Follow quality standards and feedback to continuously improve evaluation accuracy. ● Maintain confidentiality and professionalism when handling proprietary materials. REQUIREMENTS ● Strong English reading comprehension and writing ability, with attention to nuance and detail. ● Background in a technical field such as Information Technology, Computer Science, Engineering, DevOps, or a related discipline. ● Ability to interpret complex written instructions and apply them precisely. ● Excellent analytical and logical reasoning skills. ● Experience working with structured data formats like CSV or JSON. ● Capability to write concise, human-sounding explanations that clearly communicate reasoning. ● High level of accuracy, focus, and quality control when reviewing content. ● Self-discipline to work independently in a remote, task-based environment. ● Reliable internet connection and ability to meet task deadlines.

Indefinitely open

Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.