OCTOPUS: Open-vocabulary Content Tracking and Object Placement Using Semantic Understanding in Mixed Reality

Luke Yoffe, Aditya Sharma,Tobias Höllerer

2023 IEEE International Symposium on Mixed and Augmented Reality Adjunct (ISMAR-Adjunct)(2023)

引用 0|浏览3
暂无评分
摘要
One key challenge in augmented reality is the placement of virtual content in natural locations. Existing automated techniques are only able to work with a closed-vocabulary, fixed set of objects. In this paper, we introduce a new open-vocabulary method for object placement. Our eight-stage pipeline leverages recent advances in segmentation models, vision-language models, and LLMs to place any virtual object in any AR camera frame or scene. In a preliminary user study, we show that our method performs at least as well as human experts 57% of the time. 1
更多
查看译文
关键词
Semantic Content Placement,Augmented Reality,Vision and Language,Large Language Models
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要