
Posted a day ago
Senior Storage Engineer
Hydra HostSenior Storage Engineer
Requirements
8+ years designing high-performance storage systems, Expertise in block and object storage, Experience with parallel file systems, Linux systems engineering, Automation and scripting skills, Familiarity with BMC and Redfish APIs
Skills
CephLinuxNVMeS3
About the role
Responsibilities
- Define, architect, and implement Hydra Host’s first production storage platform tailored for bare-metal GPU clusters and AI/HPC workloads.
- Lead technical decisions regarding storage stack design, from hardware infrastructure to parallel file system orchestration and performance tuning.
- Select and maintain storage solutions spanning block (NVMe, SAN, Ceph) and object storage (S3-compatible) layers.
- Design for high-throughput, low-latency access to support large datasets and rapid checkpointing for distributed AI training.
- Integrate and optimize parallel file systems such as Lustre, BeeGFS, Spectrum Scale, or WekaIO.
- Develop automation, observability, and management tooling to ensure reliability and scalability.
- Collaborate cross-functionally with GPU, HPC, and platform engineering teams to integrate storage with compute and network layers.
Requirements
- 8+ years of hands-on experience designing and implementing high-performance storage systems for HPC, AI, or bare-metal cloud environments.
- Proven track record of building storage infrastructure from scratch.
- Deep expertise in block storage (NVMe, SAN, Ceph) and object storage (S3, MinIO, Ceph Object Gateway).
- Strong background in parallel file systems (WekaIO, BeeGFS, Lustre, or Spectrum Scale).
- Solid foundation in Linux systems engineering, automation, and scripting for distributed environments.
- Familiarity with BMC, Redfish APIs, and OEM server firmware for bare-metal management.
- Deep understanding of AI/ML data pipelines, including model checkpointing and data locality.
Preferred Qualifications
- Experience building storage solutions specifically for large-scale GPU or HPC infrastructure.
- History of technical leadership, mentorship, or owning a product roadmap.
- Experience managing vendor relationships and negotiating hardware/software contracts.
- Contributions to open-source HPC or storage projects (e.g., Ceph, Lustre, BeeGFS).
About the Company
Hydra Host is a Founders Fund-backed NVIDIA cloud partner building the infrastructure platform that powers AI at scale. Through our Brokkr platform, we connect high-performance GPU data centers with research labs, enterprises, and developer platforms, enabling scalable, high-performance access to next-generation NVIDIA compute.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSenior Storage Engineer
Hydra Host · Miami
