
Posted 21 hours ago
Researcher, Computer Use - Agent Post-Training
OpenAIResearcher, Computer Use - Agent Post-Training
Requirements
Machine learning fundamentals, Software engineering, Systems engineering, Statistics, LLM experience, RL/RLHF experience, Post-training experience, Synthetic data generation
Skills
Machine LearningLLM
About the role
Responsibilities
- Design and run experiments to improve agentic model behavior for complex computer use, including desktop and browser navigation.
- Own end-to-end improvements to the post-training stack, including RL, data pipelines, graders, reward signals, and evals.
- Build environments and evaluations that expose model failures and translate them into training data or new research directions.
- Partner with product teams to translate user needs into model improvements for Codex and ChatGPT.
- Work on early-training and alignment interventions, including data mixtures, synthetic data, and objective functions.
- Improve large-scale training machinery, focusing on experiment velocity, reliability, and production readiness.
Requirements
- Strong technical fundamentals in machine learning, software engineering, systems, or statistics.
- Hands-on experience with LLMs, RL, RLHF/RLAIF, or post-training methodologies.
- Experience with synthetic data generation, evals, graders, or model training.
- Proven ability to move from vague behavioral problems to concrete, actionable experiments.
- Experience with coding agents, tool-using agents, or production-scale ML systems.
Preferred Qualifications
- Experience working across research, product, and infrastructure boundaries.
- Deep interest in the intersection of frontier model training and product behavior.
- Ability to debug complex qualitative model behaviors and turn them into quantitative hypotheses.
Benefits
- Competitive salary ($250K – $380K) and generous equity.
- Comprehensive medical, dental, and vision insurance.
- 401(k) retirement plan with employer match.
- Paid parental leave and flexible PTO.
- Daily meals in the office and meal delivery credits.
- Annual learning and development stipend.
About the Company
OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We push the boundaries of AI capabilities and seek to safely deploy them to the world through our products.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeResearcher, Computer Use - Agent Post-Training
OpenAI · San Francisco
