What do we learn from a large-scale study of pre-trained visual representations in sim and real environments?

Sneha Silwal,Karmesh Yadav, Tingfan Wu,Jay Vakil,Arjun Majumdar, Sergio Arnaud,Claire Chen,Vincent-Pierre Berges,Dhruv Batra,Aravind Rajeswaran,Mrinal Kalakrishnan,Franziska Meier,Oleksandr Maksymets

CoRR（2023）

引用 0|浏览43

暂无评分

摘要

We present a large empirical investigation on the use of pre-trained visual representations (PVRs) for training downstream policies that execute real-world tasks. Our study spans five different PVRs, two different policy-learning paradigms (imitation and reinforcement learning), and three different robots for 5 distinct manipulation and indoor navigation tasks. From this effort, we can arrive at three insights: 1) the performance trends of PVRs in the simulation are generally indicative of their trends in the real world, 2) the use of PVRs enables a first-of-its-kind result with indoor ImageNav (zero-shot transfer to a held-out scene in the real world), and 3) the benefits from variations in PVRs, primarily data-augmentation and fine-tuning, also transfer to the real-world performance. See project website for additional details and visuals.

查看译文

关键词

representations,real environments,large-scale,pre-trained

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要