Gated Multi-modal Fusion with Cross-modal Contrastive Learning for Video Question Answering.
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT VII(2023)
关键词
Video Question Answering,Gated Multi-Modal Fusion,Cross-Modal Contrastive Learning
AI 理解论文
溯源树
样例

生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要