How Many Bits Does It Take For A Stimulus To Be Salient?

2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2015)

引用 114|浏览29
暂无评分
摘要
Visual saliency has been shown to depend on the unpredictability of the visual stimulus given its surround. Various previous works have advocated the equivalence between stimulus saliency and uncompressibility. We propose a direct measure of this quantity, namely the number of bits required by an optimal video compressor to encode a given video patch, and show that features derived from this measure are highly predictive of eye fixations. To account for global saliency effects, these are embedded in a Markov random field model. The resulting saliency measure is shown to achieve state-of-the-art accuracy for the prediction of fixations, at a very low computational cost. Since most modern cameras incorporate video encoders, this paves the way for in-camera saliency estimation, which could be useful in a variety of computer vision applications.
更多
查看译文
关键词
visual saliency,visual stimulus,stimulus saliency,stimulus uncompressibility,video compressor,video patch,Markov random field model,saliency measure,video encoders,in-camera saliency estimation,computer vision applications
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要