
Posted a day ago
Software Engineer, Data Infrastructure - Research
OpenAI
Requirements
Distributed systems experience, Data pipeline engineering, Infrastructure design, API development, Scalable abstractions, Debugging large-scale machine fleets
Skills
Distributed SystemsData Pipelinesinfrastructure
About the role
Responsibilities
- Design and maintain standardized dataset APIs, including for multimodal data that exceeds memory capacity
- Build proactive testing and scale validation pipelines for dataset loading at GPU scale
- Collaborate with researchers to integrate datasets seamlessly into training and inference pipelines
- Document dataset interfaces to ensure they are discoverable, consistent, and easy to adopt
- Establish safeguards and validation systems to ensure dataset reproducibility
- Debug and resolve performance bottlenecks in distributed dataset loading across large machine fleets
- Provide visualization and inspection tools to surface errors or bottlenecks in datasets
Requirements
- Strong engineering fundamentals with experience in distributed systems, data pipelines, or infrastructure
- Experience building APIs, modular code, and scalable abstractions
- Proven ability to debug performance bottlenecks across large fleets of machines
- Experience with infrastructure design and data pipeline engineering
Preferred Qualifications
- Background knowledge in data math, probability, or distributed data theory
- Experience working with GPU-scale distributed systems or dataset scaling for real-time data
Benefits
- Competitive salary ($250K – $380K) and generous equity
- Medical, dental, and vision insurance for you and your family
- 401(k) retirement plan with employer match
- Paid parental leave and flexible PTO
- Daily meals in the office and meal delivery credits
- Annual learning and development stipend
- Relocation support for eligible employees
About the Company
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSoftware Engineer, Data Infrastructure - Research
OpenAI · San Francisco
