Commonsense-Aware Object Value Graph for Object Goal Navigation

IEEE Robotics and Automation Letters(2024)

引用 0|浏览0
暂无评分
摘要
Object goal navigation (ObjectNav) is the task of finding a target object in an unseen environment. It is one of the fundamental challenges in visual navigation as it requires both structural and semantic understanding. In this paper, we present OVG-Nav, a novel ObjectNav framework that leverages a topological graph structure called object value graph (OVG), which contains visual observations and commonsense prior knowledge. The high-level planning of OVG-Nav prioritizes subgoal nodes for exploration based on a metric called object value , which reflects the closeness to the target object. Here, we propose OVGNet, a model designed to predict the object values of each node of an OVG using observed features along with commonsense knowledge. The structure of high-level planning using OVG and low-level action decisions reduces sensitivity to accumulating sensor noises, leading to robust navigation performance. Experimental results show that OVG-Nav outperforms the baseline in success rate (SR) and success rate weighted by path length (SPL) in the MP3D dataset both in accurate sensing and noisy sensing. In addition, we show that the OVG-Nav can be transferred to the real-world robot successfully.
更多
查看译文
关键词
Vision-Based Navigation,Semantic Scene Understanding,Autonomous Agents
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要