Emergence of natural and robust bipedal walking by learning from biologically plausible objectives

Humans show unparalleled ability when maneuvering diverse terrains. While reinforcement learning (RL) has shown great promise for musculoskeletal simulation in the development of robust controllers, complex behaviors are only achievable under extensive use of motion data. We demonstrate that the com...

Full description

Saved in:
Bibliographic Details
Main Authors: Schumacher, Pierre (Author) , Geijtenbeek, Thomas (Author) , Caggiano, Vittorio (Author) , Kumar, Vikash (Author) , Schmitt, Syn (Author) , Martius, Georg (Author) , Häufle, Daniel F. B. (Author)
Format: Article (Journal)
Language:English
Published: 18 April 2025
In: iScience
Year: 2025, Volume: 28, Issue: 4, Pages: 1-12,e1-e4
ISSN:2589-0042
DOI:10.1016/j.isci.2025.112203
Online Access:Verlag, kostenfrei, Volltext: https://doi.org/10.1016/j.isci.2025.112203
Verlag, kostenfrei, Volltext: https://www.sciencedirect.com/science/article/pii/S258900422500464X
Get full text
Author Notes:Pierre Schumacher, Thomas Geijtenbeek, Vittorio Caggiano, Vikash Kumar, Syn Schmitt, Georg Martius, and Daniel F.B. Haeufle
Description
Summary:Humans show unparalleled ability when maneuvering diverse terrains. While reinforcement learning (RL) has shown great promise for musculoskeletal simulation in the development of robust controllers, complex behaviors are only achievable under extensive use of motion data. We demonstrate that the combination of a recent RL algorithm with a biologically plausible reward is capable of learning controllers for 4 different musculoskeletal models and achieves locomotion with up to 90 muscles without demonstrations. Our controllers generalize to diverse and unseen terrains, while only a single adaptive objective function is needed for training. We validate our findings on four models in two different simulators. The RL agents perform robustly with complex 3D models, where reflex-controllers are difficult to apply, and produce close-to-natural motion. This is a first step for the motor control, biomechanics, and rehabilitation communities to generate complex human movements with RL, without using motion data or simple unrepresentative models.
Item Description:Online verfügbar 11 March 2025, Version des Artikels 5 April 2025
Gesehen am 08.10.2025
Physical Description:Online Resource
ISSN:2589-0042
DOI:10.1016/j.isci.2025.112203