Senior ML Engineer – Image/Video Segmentation at Sagan Recruitment | Torre

Senior ML Engineer – Image/Video Segmentation

You'll develop high-performance ML pipelines for image and video segmentation in film production
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Contractor

Currency exchange and taxes to be paid by:

Candidate

Compensation
USD6K - 7K/month
Non-negotiable
location_on
Remote (for Brazil residents)
Remote (for Mexico residents)
Remote (for Argentina residents)
Remote (for Philippines residents)
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Posted 6 months ago

Requirements and responsibilities


Job title: Senior ML Engineer – Image/Video Segmentation Location: Remote (Preference for LATAM or CEE) Salary Range: 6000 - 7000 USD/month Work Schedule: Monday - Friday, Full-time, with at least 4 hours overlap in EST NOTE: INDEPENDENT CONTRACTOR POSITION Company Overview: Sagan is an exclusive membership community for top executives, founders, and CEOs seeking to hire and maximize the impact of international talent. We bridge the gap between global talent and US-based businesses, connecting candidates from vibrant regions like Latin America, the Philippines, India, Pakistan, Bangladesh, and Africa with leading American companies. Sagan provides a high-performance remote work environment, ensuring access to world-class opportunities for top-tier professionals. About the Company: Our member is an AI-driven post-production technology company founded by two accomplished film industry veterans — an award-winning colorist and a Sundance-winning cinematographer. They’ve built a patented machine learning system that automates complex rotoscoping and image segmentation processes for major film and television productions. Their technology is already being used by top Hollywood studios and streaming platforms, accelerating visual-effects pipelines and reducing turnaround times from weeks to hours. The team operates from a TPN-certified facility and is known for combining deep technical expertise with a strong creative culture. This is a small, tight-knit, remote-first team where collaboration, ownership, and curiosity are deeply valued. The founders work directly with every engineer and artist, fostering an environment where innovation and trust go hand in hand. They’re currently expanding their engineering team to scale their machine learning infrastructure — particularly around GPU optimization and PyTorch/CUDA deployment — as they bring new projects and funding online. The ideal candidate is excited by the intersection of AI and filmmaking, thrives in fast-moving startup environments, and takes pride in writing production-grade code that directly impacts world-class visual content. Position Overview: We’re seeking a Senior Machine Learning Engineer with deep expertise in image and video segmentation to drive the development and deployment of high-performance computer vision systems. You'll work closely with a small team to build scalable pipelines optimized for GPU infrastructure, using modern deep learning techniques and production-grade code. Key Responsibilities: Design and develop deep learning models for image and video segmentation, including temporal tracking. Build, optimize, and maintain production-ready inference pipelines using PyTorch and CUDA. Migrate legacy systems to GPU-accelerated infrastructure for real-time performance. Contribute to MLOps tooling and CI/CD workflows for model training and deployment. Collaborate with internal stakeholders to translate product needs into ML solutions. Ensure clean, efficient, and well-documented code following software engineering best practices. Qualifications: 5+ years of experience in Machine Learning, Computer Vision, or related fields. Proven experience with image and video segmentation tasks in production environments. Advanced proficiency with PyTorch and CUDA. Experience building and deploying models at scale on GPU infrastructure. Solid understanding of temporal tracking, optical flow, or related video-based modeling techniques. Native-level fluency in English, both spoken and written. Nice-to-Haves: Strong C++ skills, especially in performance-sensitive vision applications. Experience with FastAPI or similar frameworks for serving ML models. Familiarity with media production workflows or video processing pipelines. Exposure to transformer-based models for segmentation or tracking. Previous experience in startup environments or R&D-heavy teams. Comfort working independently with high autonomy.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.