VLM-3R: Vision-Language Models Augmented with Instruction-Aligned 3D Reconstruction
Zhiwen Fan, Jian Zhang, Renjie Li, Junge Zhang,Runjin Chen,Hezhen Hu,Kevin Wang, Huaizhi Qu,Dilin Wang,Zhicheng Yan, Hongyu Xu, Justin Theiss,Tianlong Chen, Jiachen Li,Zhengzhong Tu, Zhangyang Wang,Rakesh Ranjan arxiv(2025)
AI 理解论文
溯源树
样例
