Symmetry-aware Reinforcement Learning for Robotic Assembly under Partial Observability with a Soft Wrist
CoRR(2024)
摘要
This study tackles the representative yet challenging contact-rich
peg-in-hole task of robotic assembly, using a soft wrist that can operate more
safely and tolerate lower-frequency control signals than a rigid one. Previous
studies often use a fully observable formulation, requiring external setups or
estimators for the peg-to-hole pose. In contrast, we use a partially observable
formulation and deep reinforcement learning from demonstrations to learn a
memory-based agent that acts purely on haptic and proprioceptive signals.
Moreover, previous works do not incorporate potential domain symmetry and thus
must search for solutions in a bigger space. Instead, we propose to leverage
the symmetry for sample efficiency by augmenting the training data and
constructing auxiliary losses to force the agent to adhere to the symmetry.
Results in simulation with five different symmetric peg shapes show that our
proposed agent can be comparable to or even outperform a state-based agent. In
particular, the sample efficiency also allows us to learn directly on the real
robot within 3 hours.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要