SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience

Elliot Chane-Sane; Joseph Amigo; Thomas Flayols; Ludovic Righetti; Nicolas Mansard

doi:10.48550/arXiv.2409.13678

Communication Dans Un Congrès Année : 2024

SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience

(1) , (1, 2) , (1) , (2) , (1)

1
2

Elliot Chane-Sane

Fonction : Auteur

Équipe Mouvement des Systèmes Anthropomorphes

Joseph Amigo

Fonction : Auteur

Équipe Mouvement des Systèmes Anthropomorphes

New York University [New York]

Thomas Flayols

Fonction : Auteur
PersonId : 740785
IdHAL : thomas-flayols
ORCID : 0000-0001-8078-2206
IdRef : 233626999

Équipe Mouvement des Systèmes Anthropomorphes

Ludovic Righetti

Fonction : Auteur

New York University [New York]

Nicolas Mansard

Fonction : Auteur
PersonId : 13958
IdHAL : nicolas-mansard
IdRef : 111691591

Équipe Mouvement des Systèmes Anthropomorphes

Résumé

Parkour poses a significant challenge for legged robots, requiring navigation through complex environments with agility and precision based on limited sensory inputs. In this work, we introduce a novel method for training end-to-end visual policies, from depth pixels to robot control commands, to achieve agile and safe quadruped locomotion. We formulate robot parkour as a constrained reinforcement learning (RL) problem designed to maximize the emergence of agile skills within the robot's physical limits while ensuring safety. We first train a policy without vision using privileged information about the robot's surroundings. We then generate experience from this privileged policy to warm-start a sample efficient off-policy RL algorithm from depth images. This allows the robot to adapt behaviors from this privileged experience to visual locomotion while circumventing the high computational costs of RL directly from pixels. We demonstrate the effectiveness of our method on a real Solo-12 robot, showcasing its capability to perform a variety of parkour skills such as walking, climbing, leaping, and crawling.

Mots clés

Reinforcement Learning Agile Locomotion Visuomotor Control Robotics (cs.RO) Computer Vision and Pattern Recognition (cs.CV) Machine Learning (cs.LG) FOS: Computer and information sciences

Domaines

Robotique [cs.RO]

Fichier principal

2409.13678v1.pdf (3.55 Mo)

Origine	Fichiers produits par l'(les) auteur(s)

Nicolas Mansard : Connectez-vous pour contacter le contributeur

https://hal.science/hal-04730062

Soumis le : jeudi 10 octobre 2024-11:44:02

Dernière modification le : lundi 14 octobre 2024-11:24:05

Dates et versions

hal-04730062 , version 1 (10-10-2024)

Identifiants

HAL Id : hal-04730062 , version 1
ARXIV : 2409.13678
DOI : 10.48550/arXiv.2409.13678

Citer

Elliot Chane-Sane, Joseph Amigo, Thomas Flayols, Ludovic Righetti, Nicolas Mansard. SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience. Conference on Robot Learning, Nov 2024, Munich (Allemagne), Germany. ⟨10.48550/arXiv.2409.13678⟩. ⟨hal-04730062⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS INSA-TOULOUSE LAAS LAAS-GEPETTO UT1-CAPITOLE LAAS-ROBOTIQUE GENCI INSA-GROUPE ANR ANITI TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP PEPR_O2R

120 Consultations

15 Téléchargements

SoloParkour: Constrained Reinforcement Learning for Visual Locomotion from Privileged Experience

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager