Unifying the Video and Question Attentions for Open-Ended Video Question Answering.

IEEE Transactions on Image Processing(2017)

引用 57|浏览45
暂无评分
摘要
Video question answering is an important task toward scene understanding and visual data retrieval. However, current visual question answering works mainly focus on a single static image, which is distinct from the dynamic and sequential visual data in the real world. Their approaches cannot utilize the temporal information in videos. In this paper, we introduce the task of free-form open-ended vi...
更多
查看译文
关键词
Knowledge discovery,Visualization,Adaptation models,Natural languages,Motion pictures,Coherence,Hair
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要