Software Engineer, Data Infrastructure - Research at OpenAI - ScoutJobs - The AI-curated global job board
Skip to content
OpenAI
Posted a day ago

Software Engineer, Data Infrastructure - Research

OpenAI

Requirements

Distributed systems experience, Data pipeline engineering, Infrastructure design, API development, Scalable abstractions, Debugging large-scale machine fleets

Skills

Distributed SystemsData Pipelinesinfrastructure

About the role

Responsibilities

  • Design and maintain standardized dataset APIs, including for multimodal data that exceeds memory capacity
  • Build proactive testing and scale validation pipelines for dataset loading at GPU scale
  • Collaborate with researchers to integrate datasets seamlessly into training and inference pipelines
  • Document dataset interfaces to ensure they are discoverable, consistent, and easy to adopt
  • Establish safeguards and validation systems to ensure dataset reproducibility
  • Debug and resolve performance bottlenecks in distributed dataset loading across large machine fleets
  • Provide visualization and inspection tools to surface errors or bottlenecks in datasets

Requirements

  • Strong engineering fundamentals with experience in distributed systems, data pipelines, or infrastructure
  • Experience building APIs, modular code, and scalable abstractions
  • Proven ability to debug performance bottlenecks across large fleets of machines
  • Experience with infrastructure design and data pipeline engineering

Preferred Qualifications

  • Background knowledge in data math, probability, or distributed data theory
  • Experience working with GPU-scale distributed systems or dataset scaling for real-time data

Benefits

  • Competitive salary ($250K – $380K) and generous equity
  • Medical, dental, and vision insurance for you and your family
  • 401(k) retirement plan with employer match
  • Paid parental leave and flexible PTO
  • Daily meals in the office and meal delivery credits
  • Annual learning and development stipend
  • Relocation support for eligible employees

About the Company

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of the capabilities of AI systems and seek to safely deploy them to the world through our products.

ScoutJobs Agent

Get matches like this delivered daily

Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.

Get started — it's free

Software Engineer, Data Infrastructure - Research

OpenAI · San Francisco

Sign up to apply