VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech InteractionChaoyou Fu,Haojia Lin,Xiong Wang,Yi-Fan Zhang,Yunhang Shen,Xiaoyu Liu, Haoyu Cao, Zuwei Long, Heting Gao,Ke Li, Long Ma,Xiawu Zheng,Rongrong Ji,Xing Sun,Caifeng Shan,Ran HeCoRR(2025)引用 27|浏览321AI 理解论文溯源树样例生成溯源树,研究论文发展脉络Chat Paper正在生成论文摘要