Grafana Workflow Specialist & AI Evaluator at Mercor | Torre

Grafana Workflow Specialist & AI Evaluator

You'll shape AI's ability to master Grafana by designing expert-level evaluation tasks.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Freelance
A project
Compensation
USD90 - 150/hour
Negotiable
location_on
Remote (anywhere)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted 8 days ago

Responsibilities and deliverables


Overview: - We are looking for experienced Grafana power users to design expert-level evaluation tasks that test whether AI agents can use Grafana the way a real professional does. - Your domain expertise is what makes these tasks authentic. What You'll Do: - Design realistic, multi-step Grafana workflows, including dashboards, alerting rules, data source configuration, panel setup, and cross-module operations. - Perform each workflow yourself on a hosted Grafana instance to produce a reference trajectory. - Write clear, specific task prompts with measurable outcomes that can be verified programmatically. - Implement programmatic graders that check whether each instruction was completed correctly. - Review AI agent attempts at your tasks, identify where and why they fail, and tag root causes. - Calibrate task difficulty so tasks are challenging but solvable, iterating on prompts and constraints based on model performance. Requirements: - 2+ years of daily, professional Grafana experience (SRE, Platform Engineering, Observability, or similar). - Deep familiarity with PromQL, dashboard templating, alerting pipelines, and data source configuration (Prometheus, InfluxDB, etc.). - Ability to articulate workflows clearly enough for programmatic verification. - Comfort writing basic grading scripts (Python; engineering support provided as needed). Nice to Have: - Experience with Grafana API automation. - Kubernetes/infrastructure monitoring background. - Familiarity with AI evaluation or benchmarking. Time Commitment: - 10-15 hours/week minimum during the project. - Fast turnaround expected; responsiveness matters. Equal Opportunity Employer: - We consider all qualified applicants without regard to legally protected characteristics and provide reasonable accommodations upon request.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.