Recent, Rapid Advancement in Visual Question Answering: a Review

Venkat Kodali,Daniel Berleant

arxiv(2022)

引用 2|浏览2
暂无评分
摘要
Understanding visual question answering is going to be crucial for numerous human activities. However, it presents major challenges at the heart of the artificial intelligence endeavor. This paper presents an update on the rapid advancements in visual question answering using images that have occurred in the last couple of years. Tremendous growth in research on improving visual question answering system architecture has been published recently, showing the importance of multimodal architectures. Several points on the benefits of visual question answering are mentioned in the review paper as in [1], on which the present article builds, including subsequent updates in the field.
更多
查看译文
关键词
VQA,visual question answering,review,survey
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要