Emergence of natural and robust bipedal walking by learning from biologically plausible objectives

Humans show unparalleled ability when maneuvering diverse terrains. While reinforcement learning (RL) has shown great promise for musculoskeletal simulation in the development of robust controllers, complex behaviors are only achievable under extensive use of motion data. We demonstrate that the com...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Schumacher, Pierre (VerfasserIn) , Geijtenbeek, Thomas (VerfasserIn) , Caggiano, Vittorio (VerfasserIn) , Kumar, Vikash (VerfasserIn) , Schmitt, Syn (VerfasserIn) , Martius, Georg (VerfasserIn) , Häufle, Daniel F. B. (VerfasserIn)
Dokumenttyp: Article (Journal)
Sprache:Englisch
Veröffentlicht: 18 April 2025
In: iScience
Year: 2025, Jahrgang: 28, Heft: 4, Pages: 1-12,e1-e4
ISSN:2589-0042
DOI:10.1016/j.isci.2025.112203
Online-Zugang:Verlag, kostenfrei, Volltext: https://doi.org/10.1016/j.isci.2025.112203
Verlag, kostenfrei, Volltext: https://www.sciencedirect.com/science/article/pii/S258900422500464X
Volltext
Verfasserangaben:Pierre Schumacher, Thomas Geijtenbeek, Vittorio Caggiano, Vikash Kumar, Syn Schmitt, Georg Martius, and Daniel F.B. Haeufle

MARC

LEADER 00000naa a2200000 c 4500
001 1937969649
003 DE-627
005 20251008104246.0
007 cr uuu---uuuuu
008 251008s2025 xx |||||o 00| ||eng c
024 7 |a 10.1016/j.isci.2025.112203  |2 doi 
035 |a (DE-627)1937969649 
035 |a (DE-599)KXP1937969649 
040 |a DE-627  |b ger  |c DE-627  |e rda 
041 |a eng 
084 |a 33  |2 sdnb 
100 1 |a Schumacher, Pierre  |e VerfasserIn  |0 (DE-588)1362703397  |0 (DE-627)1921829648  |4 aut 
245 1 0 |a Emergence of natural and robust bipedal walking by learning from biologically plausible objectives  |c Pierre Schumacher, Thomas Geijtenbeek, Vittorio Caggiano, Vikash Kumar, Syn Schmitt, Georg Martius, and Daniel F.B. Haeufle 
264 1 |c 18 April 2025 
300 |b Illustrationen 
300 |a 16 
336 |a Text  |b txt  |2 rdacontent 
337 |a Computermedien  |b c  |2 rdamedia 
338 |a Online-Ressource  |b cr  |2 rdacarrier 
500 |a Online verfügbar 11 March 2025, Version des Artikels 5 April 2025 
500 |a Gesehen am 08.10.2025 
520 |a Humans show unparalleled ability when maneuvering diverse terrains. While reinforcement learning (RL) has shown great promise for musculoskeletal simulation in the development of robust controllers, complex behaviors are only achievable under extensive use of motion data. We demonstrate that the combination of a recent RL algorithm with a biologically plausible reward is capable of learning controllers for 4 different musculoskeletal models and achieves locomotion with up to 90 muscles without demonstrations. Our controllers generalize to diverse and unseen terrains, while only a single adaptive objective function is needed for training. We validate our findings on four models in two different simulators. The RL agents perform robustly with complex 3D models, where reflex-controllers are difficult to apply, and produce close-to-natural motion. This is a first step for the motor control, biomechanics, and rehabilitation communities to generate complex human movements with RL, without using motion data or simple unrepresentative models. 
650 4 |a Behavioral neuroscience 
650 4 |a Biological sciences 
650 4 |a Neuroscience 
700 1 |a Geijtenbeek, Thomas  |e VerfasserIn  |4 aut 
700 1 |a Caggiano, Vittorio  |e VerfasserIn  |4 aut 
700 1 |a Kumar, Vikash  |e VerfasserIn  |4 aut 
700 1 |a Schmitt, Syn  |e VerfasserIn  |4 aut 
700 1 |a Martius, Georg  |e VerfasserIn  |4 aut 
700 1 |a Häufle, Daniel F. B.  |e VerfasserIn  |0 (DE-588)1029742979  |0 (DE-627)73399668X  |0 (DE-576)377576417  |4 aut 
773 0 8 |i Enthalten in  |t iScience  |d Amsterdam : Elsevier, 2018  |g 28(2025), 4, Artikel-ID 112203, Seite 1-12,e1-e4  |h Online-Ressource  |w (DE-627)1019532106  |w (DE-600)2927064-9  |w (DE-576)502115858  |x 2589-0042  |7 nnas  |a Emergence of natural and robust bipedal walking by learning from biologically plausible objectives 
773 1 8 |g volume:28  |g year:2025  |g number:4  |g elocationid:112203  |g pages:1-12,e1-e4  |g extent:16  |a Emergence of natural and robust bipedal walking by learning from biologically plausible objectives 
856 4 0 |u https://doi.org/10.1016/j.isci.2025.112203  |x Verlag  |x Resolving-System  |z kostenfrei  |3 Volltext 
856 4 0 |u https://www.sciencedirect.com/science/article/pii/S258900422500464X  |x Verlag  |z kostenfrei  |3 Volltext 
951 |a AR 
992 |a 20251008 
993 |a Article 
994 |a 2025 
998 |g 1029742979  |a Häufle, Daniel F. B.  |m 1029742979:Häufle, Daniel F. B.  |p 7  |y j 
999 |a KXP-PPN1937969649  |e 4782599625 
BIB |a Y 
SER |a journal 
JSO |a {"relHost":[{"recId":"1019532106","pubHistory":["Volume 1 (March 23, 2018)-"],"note":["Gesehen am 11.09.2018"],"disp":"Emergence of natural and robust bipedal walking by learning from biologically plausible objectivesiScience","id":{"eki":["1019532106"],"zdb":["2927064-9"],"issn":["2589-0042"]},"title":[{"title_sort":"iScience","title":"iScience"}],"language":["eng"],"origin":[{"publisher":"Elsevier","publisherPlace":"Amsterdam ; Boston ; London ; New York ; Oxford ; Paris ; Philadelphia ; San Diego ; St. Louis","dateIssuedDisp":"[2018]-"}],"type":{"bibl":"periodical","media":"Online-Ressource"},"part":{"text":"28(2025), 4, Artikel-ID 112203, Seite 1-12,e1-e4","volume":"28","issue":"4","year":"2025","extent":"16","pages":"1-12,e1-e4"},"physDesc":[{"extent":"Online-Ressource"}]}],"person":[{"given":"Pierre","role":"aut","display":"Schumacher, Pierre","family":"Schumacher","roleDisplay":"VerfasserIn"},{"roleDisplay":"VerfasserIn","family":"Geijtenbeek","role":"aut","display":"Geijtenbeek, Thomas","given":"Thomas"},{"role":"aut","display":"Caggiano, Vittorio","given":"Vittorio","roleDisplay":"VerfasserIn","family":"Caggiano"},{"roleDisplay":"VerfasserIn","family":"Kumar","given":"Vikash","display":"Kumar, Vikash","role":"aut"},{"given":"Syn","role":"aut","display":"Schmitt, Syn","family":"Schmitt","roleDisplay":"VerfasserIn"},{"role":"aut","display":"Martius, Georg","given":"Georg","roleDisplay":"VerfasserIn","family":"Martius"},{"family":"Häufle","roleDisplay":"VerfasserIn","given":"Daniel F. B.","display":"Häufle, Daniel F. B.","role":"aut"}],"name":{"displayForm":["Pierre Schumacher, Thomas Geijtenbeek, Vittorio Caggiano, Vikash Kumar, Syn Schmitt, Georg Martius, and Daniel F.B. Haeufle"]},"physDesc":[{"extent":"16 S.","noteIll":"Illustrationen"}],"origin":[{"dateIssuedKey":"2025","dateIssuedDisp":"18 April 2025"}],"type":{"bibl":"article-journal","media":"Online-Ressource"},"language":["eng"],"note":["Online verfügbar 11 March 2025, Version des Artikels 5 April 2025","Gesehen am 08.10.2025"],"recId":"1937969649","id":{"doi":["10.1016/j.isci.2025.112203"],"eki":["1937969649"]},"title":[{"title_sort":"Emergence of natural and robust bipedal walking by learning from biologically plausible objectives","title":"Emergence of natural and robust bipedal walking by learning from biologically plausible objectives"}]} 
SRT |a SCHUMACHEREMERGENCEO1820