Senior Platform Engineer, Ingestion at LangChain | Torre
warning

Heads-up

The job you’re trying to post already exists in Torre:

Senior Platform Engineer, Ingestion

You'll define AI observability at production scale, building critical systems for ubiquitous intelligent agents.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Remote (for Netherlands residents)
Remote (for Germany residents)
Remote (for United Kingdom residents)
Remote (for Sweden residents)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
13 days ago

Requirements and responsibilities


About UsAt LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to also offer a platform for building, evaluating, deploying, and operating agents at scale.With $125M raised at Series B from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we’re at a stage where we’re continuing to develop new products, growth is accelerating, and all team members have meaningful impact on what we build and how we work together. LangChain is a place where your contributions can shape how this technology shows up in the real world.Today, our platform includes LangSmith (Observability, Evaluation, Deployment, Fleet, and Sandboxes), our open source frameworks (LangChain, LangGraph, and Deep Agents), and the newly launched LangSmith Engine for autonomous agent improvement. We have 100M+ monthly open source downloads, 6,000+ active LangSmith customers, and 5 of the Fortune 10 use LangSmith in production (+ 35% of the Fortune 500 overall), including teams at Klarna, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, LinkedIn, Monday.com, Nvidia, and Bridgewater.About the teamThe LangSmith team owns and builds LangChain's core platform for observability, evaluation, and production reliability of AI systems. From tracing and annotation to run rules, evaluations, and beyond, this team owns LangSmith end-to-end. If you want to define what great AI observability looks like at production scale, this is where that work gets done.About the roleThis role sits at the core of LangSmith: you'll own the ingestion systems, query systems, and the API, SDK, and CLI surfaces that thousands of development teams use every day. You'll work at the intersection of distributed systems and developer experience, on infrastructure that teams across the industry depend on.What you'll do:Build and scale critical systems: design and operate high-throughput, data-intensive ingestion and trace-query systems supporting LangSmith, built on SmithDB, our purpose-built database for agent observability. Build monitoring, alerting, and automated recovery so the pipeline stays resilient.Set API, SDK, and CLI standards: define and enforce the standards, tooling, and CI that power SDK generation across Python, TypeScript, Go, and Java; keep our developer surfaces consistent, high-quality, and self-served across feature teams.Own integrations: build new integrations and maintain existing ones so it's easy to use LangSmith with any AI framework, agent, or tool — keeping us framework-agnosticSolve complex problems: debug performance bottlenecks, optimize database queries, and architect solutions for distributed-system challengesRespond to incidents: participate in an on-call rotation focused on post-incident learning, automation, and preventionHow to be successful in this role:Many of these will apply to you — we don't expect every box checked.Platform engineering: hands-on experience designing and running data-intensive systems at scaleDeveloper experience: a track record of building high-quality, widely-adopted CLIs, SDKs, or API standards that developers actually enjoy usingDatabase expertise: production experience with OSS datastores (PostgreSQL, Redis)Backend languages: Strong backend software engineering skills with production-level experience in Go, Python, or TypeScript.Infrastructure expertise: solid knowledge of cloud object storage, Kubernetes, containerized infrastructure, and cloud platforms (GCP, AWS)Observability mastery: hands-on experience with observability stacks (Datadog, Prometheus/Grafana, OpenTelemetry, or similar)Operational mindset and high agency: "you build it, you run it, you own it," with a focus on sustainable practicesNice to Have:Experience: 5+ years building and operating production systems, developer-facing APIs, or bothStrong experience with JavaKnowledge of columnar file, memory formats and OLAP databasesBackground in high-growth startupsLocation: This role is fully remote within Europe, excluding France.Compensation & BenefitsWe offer competitive compensation that includes base salary, meaningful equity, and benefits such as health and dental coverage, flexible vacation, a 401(k) plan, and life insurance. Actual compensation will vary based on role, level, and location. For team members in the EU and UK, we provide locally competitive benefits aligned with regionalCompensation Philosophy:We offer competitive compensation that includes base salary, variable compensation for relevant roles, meaningful equity, benefits, and perks. Actual compensation and offerings will vary based on role, level, and location. Team members in the EU, UK, and APAC receive locally competitive benefits aligned with regional norms and regulations.BenefitsBenefits include medical, dental, and vision coverage, flexible vacation, a 401(k) plan, meals on in-office days in the US and more.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.