Solutions Engineer, Media at Protege | Torre

Solutions Engineer, Media

You'll shape the future of AI by curating critical training data, driving impact across product and customers.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Compensation is to be agreed upon.
location_on
Remote (anywhere)
Shared by
Emma of Torre.ai
4 days ago

Requirements and responsibilities


Company Overview:We are building Protege to solve the biggest unmet need in AI — getting access to the right training data. The process today is time intensive, incredibly expensive, and often ends in failure. The Protege platform facilitates the secure, efficient, and privacy-centric exchange of AI training data.Solving AI’s data problem is a generational opportunity. We’re backed by world-class investors and already powering partnerships with some of the most ambitious teams in AI. The company that succeeds will be one of the largest in AI — and in tech.We’re a lean, fast-moving, high-trust team of builders who are obsessed with velocity and impact. Our culture is built for people who thrive on ambiguity, own outcomes, and want to shape the future of data and AI.Role OverviewWe’re hiring a Solutions Engineer for our media vertical to connect Protege’s media catalog with customer AI data needs. This is not a traditional modeling role. It is an applied data curation and delivery role for fast-moving, ambiguous environments where both speed and quality matter.You will work with imperfect, evolving partner datasets and build strategies to normalize, validate, and operationalize them for downstream AI use cases. You’ll become an expert in Protege’s growing catalog of audio, video, and motion capture content — from longform assets with title-level metadata to clip-level content generated with TwelveLabs embeddings.At a high level, you will understand what customers are building, identify the content that best fits their needs, and deliver datasets that meet both technical and conceptual requirements, often on tight timelines tied to active deals.What You’ll DoOwn data quality and curate media datasetsPartner with Sales and Solutions to translate customer requirements into curation strategiesWork with imperfect partner data, including mismatched metadata, schema differences, and incomplete labelingNormalize and standardize datasets for reliable downstream useQuery and analyze Protege’s media catalog using SQL, internal APIs, and metadata tools to identify relevant contentBuild validation checks and workflows to ensure dataset integrity before deliveryIdentify, debug, and resolve data quality issues across file structures, metadata, and content alignmentUse AI tools and transcoded embeddings to surface and refine clip-level contentTurn messy, real-world data into structured datasets that meet customer and model requirementsRun iterative sample reviews with customers, incorporate feedback, refine selections, and ensure final packages meet specBe the catalog expertBuild deep expertise in Protege’s media catalog structure, metadata, and growth patternsTrack content coverage, diversity, and modality mix, and identify gaps relative to customer demandPartner with Product and Partnerships to share catalog insights that inform sourcing prioritiesOperate across product, data, and customerWork cross-functionally to ensure content packaging meets technical, ethical, and licensing requirementsDevelop methods, scripts, and internal tools that improve curation efficiency and scaleHelp shape Protege’s delivery platform, including how internal users and customers search, sample, and export dataDrive human-in-the-loop media search and curationWork closely with embedding-based systems to iterate between algorithmic selection and human reviewDefine best practices for embedding queries, relevance evaluation, and content diversityMaintain a high bar for operational excellence and quality assurance throughout the processWhat Success Looks Like30 days: Learn and get operationalBuild a working understanding of the media catalog, delivery lifecycle, and core tools.Establish strong cross-functional relationships and shadow live curation workflows.60 days: Deliver and improveLead dataset sampling and curation for active use cases, and document reusable workflows.Surface early insights on catalog coverage, metadata quality, and process improvements.90 days: Scale and influenceCreate repeatable QA and delivery workflows that increase consistency and speed.Provide actionable feedback that shapes platform, sourcing, and catalog roadmap decisions.What You Bring4-7 years of experience in data science, media analytics, technical curation, or similarly hands-on data roles.Strong SQL proficiency and comfort querying large, messy datasets to generate insight and action.Experience working with media metadata, embeddings, or unstructured content.Ability to translate nuanced customer or model requirements into concrete dataset specifications.High standard for data quality, operational rigor, and usability of delivered outputs.Clear communicator who can move between technical depth and customer-friendly clarity.Thrive in ambiguous, fast-moving environments and treats teammates with kindness.Bonus if you also have:Familiarity with video/audio processing, embeddings, or multimodal AI workflows.Prior experience curating or packaging datasets for machine learning.Background in content analysis, recommendation systems, or information retrieval.Protege ValuesPass the Loved Ones’ TestWe act with integrity and do the right thing — especially when it’s hard and no one is watching.Always Find a WayWe are resourceful, resilient builders who solve hard problems and push through obstacles.Go Fast and Grow FastVelocity matters. We move with urgency, learn quickly, and continuously improve as individuals and as a company.Practice Kindness and CandorWe communicate directly and respectfully, building trust through honest feedback and genuine care for one another.Deliver TogetherWe win as one team. Collaboration, accountability, and shared ownership drive our success.Own the Outcome. Hone the Craft.We take pride in our work, sweat the details, and continuously raise the bar for excellence.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.