Principal AI Platform Engineer at Lynx | Torre

Principal AI Platform Engineer

You'll architect and own a secure AI platform, transforming mission-critical edge systems for global impact.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Compensation
USD190k - 225k/year
location_on
Remote (for United States residents)
Shared by
Emma of Torre.ai
about 10 hours ago

Requirements and responsibilities


Who we are: Lynx delivers modular, open standards–based software that transforms how high-assurance, mission-critical edge systems are built, deployed, and maintained. Our secure edge computing solutions enable innovation and operational excellence in the world’s most demanding environments, from aerospace and defense to commercial and industrial systems. We partner across industries including automotive, medical, and critical infrastructure to deliver tailored solutions aligned with each customer’s mission and operational requirements. Our key products and services are:MOSA.ic: LYNX MOSA.ic™ is a modular software framework and architecture purpose-built for mission-critical edge computing. Based on the Modular Open Systems Approach (MOSA), it provides a flexible foundation for building secure, scalable, and certifiable edge systems.LYNX MOSA.ic.AI: LYNX MOSA.ic.AI is a unified CPU and GPU software platform that enables deterministic, certifiable deployment of AI and advanced workloads in mission-critical edge systems. It brings control, performance, and lifecycle governance together, allowing AI to operate predictably within safety-critical environments without compromising certification or system integrity. CoreSuite 2.0: CoreSuite 2.0 is Lynx’s safety-critical GPU for graphics enablement framework designed for mission-critical edge computing systems. It provides hardware-accelerated graphics, visualization, and video processing capabilities that can be certified for high-assurance systems.Services: Lynx Services is Lynx’s professional services organization that helps customers design, integrate, certify, deploy, and maintain safety- and security-critical systems. It supports industries like aerospace, defense, automotive, and industrial computing through consulting, engineering, integration, and lifecycle support, reducing development risk and accelerating certification in standards-driven, mission-critical environment.Role OverviewThis should be a builder-architect: someone who can take multiple partially mature AI tools and make them operate like one disciplined platform. The right person should be equally comfortable with engineering architecture, backend integration, cloud infrastructure, LLM tooling, and production hardening.AI workflow orchestration with LangChain / LangGraph or equivalent frameworksLLM observability, prompt/version management, and evaluation systems such as LangfuseAzure platform engineering using Container Apps, PostgreSQL, Key Vault, Entra ID, private networking, and monitoringSecure backend and API integrations with systems such as CodeBeamer, GitHub, and webhook-driven workflowsProduction hardening through infrastructure as code, CI/CD, testing, rollback, rate limiting, security controls, and auditabilityRegulated-workflow thinking, where traceability, human-in-the-loop review, and controlled change management matter as much as model qualityMission for the roleOwn the AI platform as the engineering backbone for AI-assisted certification and engineering workflows. This person should make the platform secure, stable, measurable, and extensible so that new AI tools can be built and operated with confidence.Key responsibilitiesDefine and enforce the platform standard for how AI tools use orchestration frameworks, prompt assets, tracing, and metadataBring existing advanced tools into alignment with shared platform conventions while preserving important agentic or workflow-specific behaviorBuild and maintain Azure-based production infrastructure, including networking, identity, secrets, storage, database, monitoring, and deployment patternsImplement infrastructure as code and CI/CD for sandbox-to-production promotionDeepen LLMOps capabilities, including prompt versioning, golden datasets, automated evaluations, cost tracking, feedback loops, regression detection, and release controlsOwn secure integrations with CodeBeamer, GitHub, and event-driven APIs or webhooksEstablish operational discipline through logging, alerting, rollback, test coverage, runbooks, rate limiting, and supportabilityPartner with engineering, IT, security, and compliance stakeholders to support auditable AI-assisted workflowsOwn and evolve the Platform AI to provide standard and secure approach to access AI assisted capabilities across the organization for certification workflowsMentor and coach other senior/intermediate engineers on team, provide technical guidance, and conduct architectural review for trade offsHelp define technical trajectory of the platform and AI toolsQualifications10+ years of relevant experienceBachelor’s Degree in engineering related discipline preferredStrong Python backend engineering and API integration experienceStrong Azure platform experience, especially Container Apps, VNet/private endpoints, Entra ID, Managed Identity, Key Vault, PostgreSQL, ACR, and monitoringHands-on experience with LLM application frameworks such as LangChain, LangGraph, or close equivalentsHands-on experience with LLM observability or evaluation tooling such as Langfuse or equivalent tracing and eval systemsExperience building CI/CD and infrastructure as code with Terraform, Bicep, GitHub Actions, Azure DevOps, or comparable toolsExperience securing internal platforms with RBAC, secrets management, service-to-service auth, webhook validation, rate limiting, and audit loggingAbility to design reliable multi-step or agentic workflows, including retries, state handling, guardrails, and output validationStrong operational judgment around testing, rollback, monitoring, alerting, documentation, and runbooksMust be a US CitizenStrongly preferredExperience in regulated, safety-critical, aerospace, defense, medical, or similarly controlled environmentsFamiliarity with DO-178C-style traceability, auditability, formal review workflows, or human-in-the-loop approval requirementsExperience integrating with CodeBeamer, GitHub Enterprise, Jira, or similar enterprise engineering systemsFamiliarity with C/C++ code analysis or test-generation workflowsExperience with prompt governance, change control, and evaluation datasetsSome comfort with internal-tool UI work such as React, though this should remain secondary to platform, backend, and infrastructure strengthSound Exciting? Get in touch today! We have very robust benefits including: Low-cost Medical / Dental / Vision coverage options 401K with generous employer match Responsible Paid Time Off + Paid Holidays Remote work opportunities based on roleEmployee Assistance Program (EAP) Career growth and professional development opportunities All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, or veteran status.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.