Requirements

Reinforcement learning algorithm development, RL fundamentals knowledge, Policy training in simulation/real-world, Python proficiency, PyTorch experience, Large-scale experimentation, Distributed training systems, Software engineering skills

Skills

Machine LearningPythonPyTorch

About the role

Responsibilities

Design and implement reinforcement learning algorithms for embodied agents in real-world and simulated environments
Train policies that learn from interaction, feedback, and large-scale experience across diverse tasks
Develop reward modeling, credit assignment, and exploration strategies for complex, long-horizon behaviors
Improve policy robustness to real-world challenges such as noise and partial observability
Work across online and offline RL settings, including learning from large-scale logged robot data
Collaborate with pretraining, video, generative, and robot learning teams to integrate RL into the autonomy stack
Build scalable training systems, including distributed rollouts and simulation infrastructure
Design evaluation frameworks to measure policy performance, stability, and generalization

Requirements

Experience developing and applying reinforcement learning algorithms in complex environments
Strong understanding of RL fundamentals such as policy optimization and value methods
Experience training policies in simulation and/or real-world systems
Proficiency in Python and deep learning frameworks like PyTorch
Experience with large-scale experimentation and distributed training systems
Strong experimental rigor and ability to diagnose learning systems
Solid software engineering skills to build scalable, reliable systems
Ability to operate independently and drive high-impact technical problems

Preferred Qualifications

Experience applying RL to robotics, control systems, or embodied AI
Background in offline RL, imitation learning, or hybrid learning approaches
Experience with reward modeling or human-in-the-loop learning
Experience at leading AI labs such as OpenAI, Google DeepMind, Anthropic, or xAI
Publication record in reinforcement learning, machine learning, or robotics

About the Company

Figure is an AI robotics company developing autonomous general-purpose humanoid robots. Our goal is to build embodied AI systems that can perceive, reason, and act in the real world. Figure is headquartered in San Jose, CA.

Helix AI Engineer, Reinforcement Learning

Requirements

Skills

About the role

Responsibilities

Requirements

Preferred Qualifications

About the Company

Get matches like this delivered daily

Helix AI Engineer, Reinforcement Learning