Multi-Level Model For Video Saliency Detection

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP)(2019)

引用 6|浏览44
暂无评分
摘要
This paper proposes a fast detection model for video salient objects based on recurrent network architecture. Firstly, a multi-level attention (MLA) module is designed, which integrates multi-level feature maps in a cascaded manner. It effectively extracts the semantic information and detailed information of the intra-frame. These spatial features are input into a deeper bidirectional ConvLSTM to learn temporal dependence. Secondly, the result of the forward flow output is used as a backward input, and deeper temporal dependence is extracted. Finally, we present a spatial-temporal fused bidirectional ConvLSTM framework, which reduces the accumulated memory in the bidirectional ConvLSTM by exploiting element level fusion strategy. The experimental results show that the proposed method achieves the best detection precision on the two challenging benchmarks: ViSal and FBMS datasets, with a real-time speed of 23 fps.
更多
查看译文
关键词
Salient objects, Multi-level, ConvLSTM, Spatial-temporal fusion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要