Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection

IEEE Signal Processing Letters(2022)

引用 3|浏览23
暂无评分
摘要
The multi-modal salient object detection model based on RGB-D information has better robustness in the real world. However, it remains nontrivial to better adaptively balance multi-modal information in the feature fusion phase. In this letter, we propose a novel gated recoding network (GRNet) to evaluate the information validity of the two modes and balance their influence. Our framework is divided into three phases: perception phase, recoding mixing phase, and integration phase. Specifically, a perception encoder is adopted to extract multi-level single-modal features, which lays the foundation for multi-modal semantic comparative analysis. Then, a modal-adaptive gate unit (MGU) is proposed to suppress the invalid information and transfer the effective modal features to the recoding mixer and the hybrid branch decoder. The recoding mixer is responsible for recoding and mixing the balanced multi-modal information. Finally, the hybrid branch decoder completes the multi-level feature integration under the guidance of an optional edge guidance stream (OEGS). Experiments on 8 popular benchmarks verify that our framework has better overall performance than the other 28 state-of-the-art algorithms.
更多
查看译文
关键词
Salient object detection,multi-modal,gated mechanism,edge guidance,feature fusion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要