Focal Visual-Text Attention for Memex Question AnsweringEIWOS
IEEE Transactions on Pattern Analysis and Machine Intelligence, pp. 1-1, 2019.
Keywords:Task analysisKnowledge discoveryVisualizationGroundingMetadataview more (1+)
Recent insights on language and vision with neural networks have been successfully applied to simple single-image visual question answering. However, to tackle real-life question answering problems on multimedia collections such as personal photo albums, we have to look at whole collections with sequences of photos. This paper proposes a ...More