
Posted 4 days ago
Helix AI Engineer, Reinforcement Learning
FigureHelix AI Engineer, Reinforcement Learning
Requirements
Reinforcement learning algorithm development, RL fundamentals knowledge, Policy training in simulation/real-world, Python proficiency, PyTorch experience, Large-scale experimentation, Distributed training systems, Software engineering skills
Skills
Machine LearningPythonPyTorch
About the role
Responsibilities
- Design and implement reinforcement learning algorithms for embodied agents in real-world and simulated environments
- Train policies that learn from interaction, feedback, and large-scale experience across diverse tasks
- Develop reward modeling, credit assignment, and exploration strategies for complex, long-horizon behaviors
- Improve policy robustness to real-world challenges such as noise and partial observability
- Work across online and offline RL settings, including learning from large-scale logged robot data
- Collaborate with pretraining, video, generative, and robot learning teams to integrate RL into the autonomy stack
- Build scalable training systems, including distributed rollouts and simulation infrastructure
- Design evaluation frameworks to measure policy performance, stability, and generalization
Requirements
- Experience developing and applying reinforcement learning algorithms in complex environments
- Strong understanding of RL fundamentals such as policy optimization and value methods
- Experience training policies in simulation and/or real-world systems
- Proficiency in Python and deep learning frameworks like PyTorch
- Experience with large-scale experimentation and distributed training systems
- Strong experimental rigor and ability to diagnose learning systems
- Solid software engineering skills to build scalable, reliable systems
- Ability to operate independently and drive high-impact technical problems
Preferred Qualifications
- Experience applying RL to robotics, control systems, or embodied AI
- Background in offline RL, imitation learning, or hybrid learning approaches
- Experience with reward modeling or human-in-the-loop learning
- Experience at leading AI labs such as OpenAI, Google DeepMind, Anthropic, or xAI
- Publication record in reinforcement learning, machine learning, or robotics
About the Company
Figure is an AI robotics company developing autonomous general-purpose humanoid robots. Our goal is to build embodied AI systems that can perceive, reason, and act in the real world. Figure is headquartered in San Jose, CA.
ScoutJobs Agent
Get matches like this delivered daily
Sign up free — we'll pull jobs that fit your CV from across the web and rank them for you.
Get started — it's freeHelix AI Engineer, Reinforcement Learning
Figure · San Jose
