Sr/Staff Lustre Engineer at DDN | Torre

Sr/Staff Lustre Engineer

You'll shape the future of high-performance Lustre, optimizing petabyte-scale environments and upstreaming critical features.
Emma highlights
This highlight was written by Emma’s AI. Ask Emma to edit it.
Full-time

Legal agreement: Employment

Compensation
USD150k - 250k/year
location_on
Remote (for United States residents)
Match
skeleton-gauges
You have opted out of job matches in .
To undo this, go to the 'Skills and Interests' section of your preferences.
Review preferences
Shared by
Emma of Torre.ai
about 11 hours ago

Requirements and responsibilities


If you’ve spent years deep in the Lustre codebase not just administering Lustre systems, but actually designing, building, optimizing, and upstreaming features — this is a chance to work on one of the most demanding storage engineering problems in the market.We’re looking for a hands-on Lustre Engineer to help shape the next generation of high-performance, enterprise-ready Lustre capabilities. This role is for someone who is equally comfortable working in kernel-space C, analyzing performance bottlenecks in large-scale deployments, and collaborating with other senior engineers to evolve Lustre architecture for modern infrastructure.What you’ll doContribute to Lustre architecture and lead implementation of new features across core subsystems including MDS/OSS, LDLM, DNE, DoM, FLD, layout, snapshot, quota, security, and recoveryWrite high-performance, production-grade code across the Lustre kernel module, user-space utilities, and supporting librariesOwn the full development lifecycle for major features, from design and coding through testing, performance tuning, and upstream patch submissionImprove scalability for multi-petabyte Lustre environments, including metadata performance, I/O path optimization, and integration with modern interconnects and storage fabricsModernize Lustre internals through kernel API updates, locking improvements, RDMA enhancements, and cloud/container-oriented capabilitiesPrototype and evaluate emerging technologies such as NVMe-oF, CXL, and other next-generation storage approaches for potential Lustre integrationStrengthen regression, stress, and performance validation using tools such as IOR, mdtest, fio, and custom Lustre test toolingPerform deep code-level performance analysis using Lustre tracing tools, perf, lockstat, and kernel profilersWork closely with Principal Engineers, hardware teams, and application owners to translate real workload needs into practical Lustre improvementsMentor other developers and contribute design docs, technical guides, and engineering reviewsWhat we’re looking for7+ years of progressive, hands-on Lustre software development experienceA strong track record of meaningful contributions to the Lustre codebase in production or open-source environmentsDemonstrated success designing, implementing, and upstreaming Lustre features that made it into major releasesDeep understanding of Lustre internals, including MDS/OSS architecture, LDLM locking and recovery, journaling, DNE, DoM, FLD, layout, quota, and securityExpert-level C programming and strong Linux kernel development experienceStrong knowledge of distributed systems, parallel file systems, and high-performance I/OExperience with Lustre build, development, and debugging tooling such as lctl, llstat, strace, perf, and kernel tracersPython and/or Bash skills for tooling and automationBachelor’s or Master’s degree in Computer Science, Computer Engineering, or a related fieldClear written and verbal communication skills, with the ability to explain complex technical concepts effectivelyNice to have7+ years in HPC or storage software developmentVisible contributions to the Lustre open-source community, including accepted patches, LUG participation, or public technical talksExperience running or developing Lustre in cloud environments such as AWS FSx for Lustre, Azure Lustre, or custom cloud deploymentsFamiliarity with adjacent systems such as Ceph, DAOS, BeeGFS, WekaIO, or GPFSExposure to RDMA, NVMe-oF, CXL, computational storage, or AI/ML I/O workloadsConference presentations, technical publications, or other public engineering contributionsYou’ll likely be a strong fit if you…Have built inside Lustre itself, not just around itEnjoy solving hard kernel, performance, and distributed systems problemsCare about code quality, upstream contribution, and long-term architectural impactWant to work on infrastructure that operates at real scale under real performance constraintsThis role is probably not the right fit if you…Have mainly supported or administered Lustre rather than developed itCome from general storage or kernel backgrounds without clear Lustre code contribution historyPrefer high-level application engineering over low-level systems and kernel workSalary Range: $150,000 - $250,000Why this role is interestingThis is a rare opportunity to work on deeply technical, high-impact storage engineering problems in a domain where performance, scale, reliability, and upstream credibility all matter. If you want to influence the future of Lustre in production-grade environments, this is the kind of role where your work can genuinely shape the platform.Call to actionIf you’ve made meaningful contributions to Lustre and want to work on large-scale filesystem innovation, we’d love to hear from you.
Optionally, you can add more information later (benefits, pre-screening questions, etc.)
check_circle

Payment confirmed

A member of the Torre team will contact you shortly

In the meantime, continue adding information to your job opening.