VP-GO: A ‘Light’ Action-Conditioned Visual Prediction Model for Grasping Objects

Anji Ma,Yoann Fleytoux,Jean-Baptiste Mouret,Serena Ivaldi

2022 International Conference on Advanced Robotics and Mechatronics (ICARM)（2022）

引用 1|浏览4

暂无评分

摘要

Visual prediction models are promising solutions for visual-based robotic grasping of cluttered, unknown soft objects. Previous models from the literature are computationally greedy, which limits reproducibility; although some consider stochasticity in the prediction model, it is often too weak to catch the reality of robotics experiments involving grasping such objects. Furthermore, previous work focused on elementary movements that are not efficient to reason in terms of more complex semantic actions. To address these limitations, we propose VP-GO, a "light" stochastic action-conditioned visual prediction model. We propose a hierarchical decomposition of semantic grasping and manipulation actions into elementary end-effector movements, to ensure compatibility with existing models and datasets for visual prediction of robotic actions such as RoboNet. We also record and release a new open dataset for visual prediction of object grasping, called PandaGrasp. Our model can be pre-trained on RoboNet and fine-tuned on PandaGrasp, and performs similarly to more complex models in terms of signal prediction metrics. Qualitatively, it outperforms when predicting the outcome of complex grasps performed by our robot.

查看译文

关键词

grasping objects,visual prediction model,action-conditioned

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要