
Posted 21 hours ago
Software Engineer, ML Systems & Training Architecture
OpenAISoftware Engineer, ML Systems & Training Architecture
Requirements
Strong software engineering fundamentals, Experience with ML systems or training frameworks, Experience with GPUs or distributed systems, Ability to debug complex technical environments
Skills
Machine LearningDistributed Systems
About the role
Responsibilities
- Review, improve, and clean up code across training frameworks and adjacent infrastructure
- Identify risky or low-quality changes to raise the code quality bar without slowing the team down
- Debug complex issues across ML training systems, GPUs, clusters, networking, and related infrastructure
- Unblock researchers and engineers by fixing broken training jobs, flaky workflows, and brittle internal tooling
- Improve the reliability, maintainability, and usability of the robotics team’s training framework
- Move quickly on practical engineering problems that directly affect team velocity
Requirements
- Strong software engineering fundamentals and excellent code review judgment
- Experience with ML systems, training frameworks, GPUs, or distributed systems
- Proven ability to read and debug unfamiliar codebases and complex technical environments quickly
- Experience shipping high-quality code with strong velocity and pragmatic judgment
- Ability to work onsite in San Francisco 5 days per week
Preferred Qualifications
- Experience reviewing messy, fast-moving, or AI-generated codebases
- A preference for being a highly effective hands-on individual contributor over driving process-heavy initiatives
Benefits
- Competitive salary ($295K – $380K) and generous equity
- Medical, dental, and vision insurance for you and your family
- 401(k) retirement plan with employer match
- Paid parental leave and flexible PTO
- Daily meals in the office and meal delivery credits
- Annual learning and development stipend
- Relocation support for eligible employees
About the Company
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeSoftware Engineer, ML Systems & Training Architecture
OpenAI · San Francisco
