Requirements

Machine learning fundamentals, Software engineering, Systems engineering, Statistics, LLM experience, RL/RLHF experience, Post-training experience, Synthetic data generation

Skills

Machine LearningLLM

About the role

Responsibilities

Design and run experiments to improve agentic model behavior for complex computer use, including desktop and browser navigation.
Own end-to-end improvements to the post-training stack, including RL, data pipelines, graders, reward signals, and evals.
Build environments and evaluations that expose model failures and translate them into training data or new research directions.
Partner with product teams to translate user needs into model improvements for Codex and ChatGPT.
Work on early-training and alignment interventions, including data mixtures, synthetic data, and objective functions.
Improve large-scale training machinery, focusing on experiment velocity, reliability, and production readiness.

Requirements

Strong technical fundamentals in machine learning, software engineering, systems, or statistics.
Hands-on experience with LLMs, RL, RLHF/RLAIF, or post-training methodologies.
Experience with synthetic data generation, evals, graders, or model training.
Proven ability to move from vague behavioral problems to concrete, actionable experiments.
Experience with coding agents, tool-using agents, or production-scale ML systems.

Preferred Qualifications

Experience working across research, product, and infrastructure boundaries.
Deep interest in the intersection of frontier model training and product behavior.
Ability to debug complex qualitative model behaviors and turn them into quantitative hypotheses.

Benefits

Competitive salary ($250K – $380K) and generous equity.
Comprehensive medical, dental, and vision insurance.
401(k) retirement plan with employer match.
Paid parental leave and flexible PTO.
Daily meals in the office and meal delivery credits.
Annual learning and development stipend.

About the Company

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of AI capabilities and seek to safely deploy them to the world through our products.

Researcher, Computer Use - Agent Post-Training

Requirements

Skills

About the role

Responsibilities

Requirements

Preferred Qualifications

Benefits

About the Company

Get matches like this delivered daily

Researcher, Computer Use - Agent Post-Training