Revisiting Image-Language Networks for Open-Ended Phrase Detection
IEEE Transactions on Pattern Analysis and Machine Intelligence(2022)
摘要
Most existing work that grounds natural language phrases in images starts with the assumption that the phrase in question is relevant to the image. In this paper we address a more realistic version of the natural language grounding task where we must both identify whether the phrase is relevant to an image and localize the phrase. This can also be viewed as a generalization of object ...
更多查看译文
关键词
Task analysis,Grounding,Visualization,Feature extraction,Benchmark testing,Detectors,Vocabulary
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络