Senior Engineer - Kubernetes at Datum | Torre
warning

Heads-up

The job you’re trying to post already exists in Torre:

Senior Engineer - Kubernetes

You'll architect and scale open-source cloud control planes, empowering 1k clouds in the AI era.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Provide your expected compensation while applying
location_on
Remote (for United States residents)
Remote (for United Kingdom residents)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
2 months ago

Requirements and responsibilities


About the roleDatum’s mission is to help 1k clouds thrive in the AI era by unlocking internet superpowers for every builder. We’re working in the open to bring the foundational capabilities that all the big guys use (private networking, peering, direct interconnection, etc) into the hands of builders and modern “alt clouds” — no network team required.One of Datum’s core values is to be connectors: of applications, services, networks, and people. As such, this role will work directly with users, customers, partners, and the broader community.Another key value is to be open by default, from how we license our code (AGPLv3) to how we communicate, engage with, and document our work.The RoleWe're seeking a Senior+ Engineer to build and run critical components of the Datum Cloud control plane. This is a senior technical leadership role focused on designing and building features woven into our open source business operating system, Milo, a toolkit for modern AI-forward alt-clouds.You’ll work extensively with distributed systems, vendor APIs, networking protocols, software-defined networking, and cloud-native infrastructure while solving complex orchestration challenges across multiple cloud providers and edge locations. This role requires deep Kubernetes expertise, hardened by production operation, combined with a passion for open-source development and building systems that other engineers love to use.What You'll DoControl Plane Infrastructure & ArchitectureDesign, implement, and run Datum's core orchestration stackBuild customer-facing solutions to help our alt-cloud ecosystem thriveScale the management, monitoring, and metering of our edge locationsPartner with leadership to advance projects with key customers, partners, and suppliersDistributed Systems & PerformanceDesign distributed solutions that scale from startup to hyperscale usage patternsImplement intelligent traffic routing, load balancing, and failoverBuild observability, monitoring, and diagnostic tools for complex environmentsOptimize control plane performance for AI workloads and high-bandwidth applications with our network teamOpen Source LeadershipDrive technical networking decisions in collaboration with our open-source communityReview and mentor contributions from external developers on networking componentsMaintain high code quality standards and documentation for network APIsRepresent Datum at conferences and in technical working groupsCloud-Native & AI IntegrationDesign networking solutions that integrate seamlessly with Kubernetes and AI patternsBuild network policies and security frameworks for multi-tenant cloud environmentsImplement service mesh integration and east-west traffic optimizationEnsure compatibility with major cloud provider networking services (AWS, GCP, Azure)About YouDistributed Systems6+ years of large-scale production systems running Kubernetes with security as a first principleStrong experience with Kubernetes patterns and APIs, having written custom resources, controllers, and preferably exposure to kubebuilderStrong experience with distributed systems design, security, auth, consensus algorithms, async reconciliation, and fault toleranceExperience modeling data in Kubernetes, or transferable knowledge from RDBMS, GraphQL, information retrievalCloud & Infrastructure ExperienceExtensive experience with multi-cloud networking and hybrid cloud connectivityDeep knowledge of Kubernetes networking, CNI plugins, and service mesh architecturesExperience with infrastructure as code (Flux, Terraform, Pulumi) for provisioningUnderstanding of edge computing, CDN architectures, and global traffic managementFamiliarity with SRv6, eBPF, DPDK, VPP, mpTCP and other advanced networking technologies would be a huge plusOpen Source & LeadershipTrack record of contributing to or maintaining networking-focused open-source projectsExperience mentoring engineers and driving technical decision-making in teamsUnderstanding of open-source governance, community building, and public developmentPassion for building networking tools that other developers and operators love to useTechnology StackLanguages: Go, RustData: PostgreSQL, GraphQL, Elasticsearch, MeilisearchInfrastructure: Kubernetes, Flux, PulumiCloud Platforms: Cloudflare, AWS, GCP, Azure, multi-cloud networkingMonitoring: Prometheus, Grafana, OpenTelemetry, network flow analysisDevelopment: GitHub, CI/CD, automated testing, network simulationOpen Source CommitmentThis role involves significant public development work. You’ll be:Contributing to Datum's public networking repositories with transparent developmentEngaging with the community through GitHub issues, RFCs, and technical discussionsSpeaking at networking conferences and writing technical blog postsCollaborating with external contributors, cloud providers, and other partnersMaintaining high standards for code quality, performance, and documentationWhat Success Looks LikeAdoption and growth for Datum in the cloud-native and AI infrastructure communitiesHigh-performance, reliable network connectivity across diverse cloud environmentsStrong developer experience as evidenced by community contributions and feedbackTechnical leadership recognized within the networking and distributed infra ecosystemScalable network architecture supporting the next generation of AI hyperscalersWe believe in openness, clarity, and collaboration. To learn more about how Datum aims to operate, please review our public handbook.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.