Hierarchical Graph Attention Network for Visual Relationship Detection

CVPR（2020）

引用 76|浏览144

暂无评分

摘要

Visual Relationship Detection (VRD) aims to describe the relationship between two objects by providing a structural triplet shown as . Existing graph-based methods mainly represent the relationships by an object-level graph, which ignores to model the triplet-level dependencies. In this work, a Hierarchical Graph Attention Network (HGAT) is proposed to capture the dependencies on both object-level and triplet-level. Object-level graph aims to capture the interactions between objects, while the triplet-level graph models the dependencies among relation triplets. In addition, prior knowledge and attention mechanism are introduced to fix the redundant or missing edges on graphs that are constructed according to spatial correlation. With these approaches, nodes are allowed to attend over their spatial and semantic neighborhoods\u0027 features based on the visual or semantic feature correlation. Experimental results on the well-known VG and VRD datasets demonstrate that our model significantly outperforms the state-of-the-art methods.

查看译文

关键词

structural triplet,subject-predicate-object,object-level graph,triplet-level dependencies,triplet-level graph models,relation triplets,attention mechanism,visual feature correlation,semantic feature correlation,visual relationship detection,hierarchical graph attention network,HGAT,spatial correlation

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要