Unknown

Dataset Information

0

Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking.


ABSTRACT: A control system for bipedal walking in the sagittal plane was developed in simulation. The biped model was built based on anthropometric data for a 1.8 m tall male of average build. At the core of the controller is a deep deterministic policy gradient (DDPG) neural network that was trained in GAZEBO, a physics simulator, to predict the ideal foot placement to maintain stable walking despite external disturbances. The complexity of the DDPG network was decreased through carefully selected state variables and a distributed control system. Additional controllers for the hip joints during their stance phases and the ankle joint during toe-off phase help to stabilize the biped during walking. The simulated biped can walk at a steady pace of approximately 1 m/s, and during locomotion it can maintain stability with a 30 kg·m/s impulse applied forward on the torso or a 40 kg·m/s impulse applied rearward. It also maintains stable walking with a 10 kg backpack or a 25 kg front pack. The controller was trained on a 1.8 m tall model, but also stabilizes models 1.4-2.3 m tall with no changes.

SUBMITTER: Liu C 

PROVIDER: S-EPMC6477666 | biostudies-literature | 2019 Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

Implementation of Deep Deterministic Policy Gradients for Controlling Dynamic Bipedal Walking.

Liu Chujun C   Lonsberry Andrew G AG   Nandor Mark J MJ   Audu Musa L ML   Lonsberry Alexander J AJ   Quinn Roger D RD  

Biomimetics (Basel, Switzerland) 20190322 1


A control system for bipedal walking in the sagittal plane was developed in simulation. The biped model was built based on anthropometric data for a 1.8 m tall male of average build. At the core of the controller is a deep deterministic policy gradient (DDPG) neural network that was trained in GAZEBO, a physics simulator, to predict the ideal foot placement to maintain stable walking despite external disturbances. The complexity of the DDPG network was decreased through carefully selected state  ...[more]

Similar Datasets

| S-EPMC4201655 | biostudies-literature
| S-EPMC2696144 | biostudies-literature
| S-EPMC6699981 | biostudies-literature
| S-EPMC6002472 | biostudies-literature
| S-EPMC8149599 | biostudies-literature
| S-EPMC8228592 | biostudies-literature
| S-EPMC6302870 | biostudies-other
| S-EPMC3673057 | biostudies-literature
| S-EPMC4279450 | biostudies-literature
| S-EPMC4600717 | biostudies-literature