Visual relation of interest detection based on part detection

INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE AND ROBOTICS 2021(2021)

引用 1|浏览2
暂无评分
摘要
Visual relation detection (VRD) aims to describe images with relation triplets like , paying attention to the interaction between every two instances. To detect the visual relations that express the main content of a given image, visual relation of interest detection (VROID) is proposed as an extension of the traditional VRD task. The existing methods related to the general VRD task are mostly based on instance-level features and the methods that adopt detailed information only use part-level attention or human body parts. None of the existing methods take advantage of general semantic parts. Therefore, on the basis of the IPNet for VROID, we further propose an interest propagation form part (IPFP) method which propagates interest along "part-instance-pair-triplet" to detect visual relations of interest. The IPFP method consists of four modules. Panoptic Object-Part Detection module, which extracts instances with instance features and instance parts with part features, Part Interest Prediction module. which predicts interest for every single part, Instance Interest Prediction module, which predicts interest for every single instance; the PairiP module predicts interest for each pair of instances; and the PredIP module predicts possible predicates for each instance pairs, Pair Interest Prediction module. which predicts interest for each pair of instances, and Predicate Interest Prediction module. which predicts possible predicates for each instance pairs. The interest scores of visual relations are the product of pair interest scores and predicate possibilities for pairs. We evaluate the performance of the IPFP method and the effectiveness of important components using the ViROI dataset for VROID.
更多
查看译文
关键词
Visual relation of interest detection, interest propagation network, interest propagation from part
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要