LongVU: Spatiotemporal Adaptive Compression for Long Video-Language Understanding
Xiaoqian Shen,Yunyang Xiong,Changsheng Zhao,Lemeng Wu,Jun Chen,Chenchen Zhu,Zechun Liu,Fanyi Xiao,Balakrishnan Varadarajan,Florian Bordes,Zhuang Liu,Hu Xu,Hyunwoo J. Kim,Bilge Soran,Raghuraman Krishnamoorthi,Mohamed Elhoseiny,Vikas Chandra CoRR(2024)
AI 理解论文
溯源树
样例
