Unknown

Dataset Information

0

Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics.


ABSTRACT: Locomotion control has long been vital to legged robots. Agile locomotion can be implemented through either model-based controller or reinforcement learning. It is proven that robust controllers can be obtained through model-based methods and learning-based policies have advantages in generalization. This paper proposed a hybrid framework of locomotion controller that combines deep reinforcement learning and simple heuristic policy and assigns them to different activation phases, which provides guidance for adaptive training without producing conflicts between heuristic knowledge and learned policies. The training in simulation follows a step-by-step stochastic curriculum to guarantee success. Domain randomization during training and assistive extra feedback loops on real robot are also adopted to smooth the transition to the real world. Comparison experiments are carried out on both simulated and real Wukong-IV humanoid robots, and the proposed hybrid approach matches the canonical end-to-end approaches with higher rate of success, faster converging speed, and 60% less tracking error in velocity tracking tasks.

SUBMITTER: Wang Z 

PROVIDER: S-EPMC9611364 | biostudies-literature | 2022 Oct

REPOSITORIES: biostudies-literature

altmetric image

Publications

Hybrid Bipedal Locomotion Based on Reinforcement Learning and Heuristics.

Wang Zhicheng Z   Wei Wandi W   Xie Anhuan A   Zhang Yifeng Y   Wu Jun J   Zhu Qiuguo Q  

Micromachines 20221007 10


Locomotion control has long been vital to legged robots. Agile locomotion can be implemented through either model-based controller or reinforcement learning. It is proven that robust controllers can be obtained through model-based methods and learning-based policies have advantages in generalization. This paper proposed a hybrid framework of locomotion controller that combines deep reinforcement learning and simple heuristic policy and assigns them to different activation phases, which provides  ...[more]

Similar Datasets

| S-EPMC9899902 | biostudies-literature
| S-EPMC2849068 | biostudies-literature
| S-EPMC11564515 | biostudies-literature
| S-EPMC10426021 | biostudies-literature
| S-EPMC6892559 | biostudies-literature
| S-EPMC10092184 | biostudies-literature
| S-EPMC10452096 | biostudies-literature
| S-EPMC9484268 | biostudies-literature
| S-EPMC7224393 | biostudies-literature
| S-EPMC6002472 | biostudies-literature