Software Engineer, System Enablement at OpenAI - ScoutJobs - The AI-curated global job board
Skip to content
OpenAI
Posted a day ago

Software Engineer, System Enablement

OpenAI

Requirements

BS in CS/EE or equivalent, 5+ years systems SW development, Linux-based infrastructure experience, Kubernetes cluster operations, Infrastructure-as-Code (Terraform, Chef, Ansible), Provisioning and imaging (PXE, iPXE, cloud-init), Networking fundamentals (L2/L3, DNS, routing), Automation in Python, Go, or Bash

Skills

KubernetesTerraformPythonGoLinuxBash

About the role

Responsibilities

  • Own the end-to-end bring-up and bootstrap path for new systems and compute nodes from bare metal to schedulable fleet capacity.
  • Build and maintain golden image and provisioning workflows across lab and production environments.
  • Integrate nodes into fleet infrastructure and IaC pipelines using tools like Terraform and Chef.
  • Partner with scheduling and platform owners to ensure hardware reachability, network connectivity, and routing.
  • Drive registration and inventory correctness to ensure systems are visible and tracked end-to-end.
  • Collaborate on implementing baseline health and telemetry signals for automated reporting.
  • Debug complex issues across layers including PXE, UEFI/BIOS, BMC, OS, and network reachability.

Requirements

  • BS in Computer Science, Electrical Engineering, or equivalent practical experience.
  • 5+ years of experience in systems software development and operating Linux-based infrastructure.
  • Hands-on experience with Kubernetes cluster operations and node lifecycle management.
  • Proficiency with Infrastructure-as-Code and configuration management (Terraform, Chef, or Ansible).
  • Experience with provisioning and imaging technologies such as PXE, iPXE, and cloud-init.
  • Strong understanding of networking fundamentals including L2/L3, routing, DNS, and firewalling.
  • Ability to write automation in Python, Go, or Bash to ship operational tooling.

Preferred Qualifications

  • Experience bringing up new hardware platforms (early silicon, servers, or NICs) in a lab setting.
  • Multi-cloud operational experience with Azure, GCP, AWS, or OCI.
  • Experience building telemetry and health pipelines using agent-based metrics and logging.
  • Familiarity with WAN, peering, and multi-site network concepts for cluster deployments.

Benefits

  • Competitive salary and generous equity packages.
  • Comprehensive medical, dental, and vision insurance.
  • 401(k) retirement plan with employer match.
  • Paid parental leave and flexible PTO.
  • Daily meals in the office and mental health support.
  • Annual learning and development stipend.

About the Company

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of AI capabilities and seek to safely deploy them to the world through our products.

ScoutJobs Agent

Get matches like this delivered daily

Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.

Get started — it's free

Software Engineer, System Enablement

OpenAI · San Francisco

Sign up to apply