Unsupervised multi-perspective fusing semantic alignment for cross-modal hashing retrieval

Multimedia Tools and Applications(2024)

引用 0|浏览0
暂无评分
摘要
Due to its low computational‘ cost, excellent storage capacity, and efficient retrieval performance, unsupervised deep cross-modal hashing methods have received extensive attention. However, there are still some challenges with existing unsupervised methods: (1) Due to the lack of label semantics, the neighborhood structure information of unimodal and inter-modal instances may not be fully integrated, resulting in ignoring the deep semantic similarity interaction information. (2) Unsupervised hash codes can neither effectively resolve the semantic consistency between the original features of modal instances nor bridge the gap between the heterogeneous modalities of hash codes. To address these issues, we propose a new unsupervised deep cross-modal hash method called Multi-Perspective Fusing Semantic Alignment Hashing (MPFSAH). It mainly includes two aspects. Firstly, to enhance inter-modal communication, a Multi-level Semantic Similarity Interactive Measure (MSSIM) is constructed. By fusing the neighborhood structure of different modalities and increasing the distance between instances within a modality, the semantic interaction similarity can be deeply mined, to obtain discriminative semantic information. Moreover, we also propose a novel Multi-Perspective Semantic Alignment Mechanism (MPSAM). By minimizing the consistency quantization error of elements in the multi-perspective similarity, it learns the inter-modal similarity consistency. MPSAM includes similarity consistency alignment, structural-semantic alignment, and ranking alignment. It achieves structural-semantic consistency fully ensures the effective connection of cross-modal data similarities and bridges the modal gap in the process of hash codes. Through experiments on three cross-modal retrieval datasets, we demonstrate the effectiveness of our proposed method, which outperforms some state-of-the-art methods.
更多
查看译文
关键词
Unsupervised hashing,Cross-modal retrieval,Similarity preservation,Semantic alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要