Quadruped Reinforcement Learning without Explicit State Estimation.

ROBIO(2022)

引用 0|浏览7
暂无评分
摘要
Reinforcement learning is a promising approach to developing legged robot locomotion controllers. The gen-eral process of development is: large-scale training in the virtual simulation environment to obtain reliable control policy network, and then the policy network is deployed to real legged robot. In the training procedure, a complete robot state increases the speed of training and the stability of policy. The robot's states like body velocities are easily available in simulation training, but they are difficult to obtain in the real robot, hence specifically designed robot state estimators are needed. However, the development of state estimators requires expert knowledge related to control theory and robotics, limiting the direct application of reinforcement learning to robots. To take advantage of the end-to-end mapping of artificial neural networks, we simplified the existing reinforcement learning process for quadruped robots and propose a training method based on curriculum learning in this work. The proposed method can produce a reliable policy that does not require robot state estimator and only take raw sensors data. The feasibility of the proposed method is verified in simulations and real quadrupedal robot. Video of the quadrupedal robot is available at www.youtube.com/watch?v=-iho4KIlEPw.
更多
查看译文
关键词
artificial neural networks,curriculum learning,end-to-end mapping,expert knowledge,explicit state estimation,legged robot locomotion controllers,policy stability,quadruped reinforcement learning,quadrupedal robot,robot state estimator design,virtual simulation,virtual simulation environment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要