A Mid-Level Representation Of Visual Structures For Video Compression
2016 IEEE Winter Conference on Applications of Computer Vision (WACV)(2016)
摘要
A video coding system is presented that partitions the scene into "visual structures" and a residual "background" layer. A low-level representation ("track-template") of visual structures is proposed that exploits their temporal redundancy. A dictionary of track-templates is constructed that is used to encode video frames. We make optimal use of the dictionary in terms of rate-distortion by choosing a subset of the dictionary's elements for encoding using a Markov Random Field (MRF) formulation that places the track-templates in "depth" layers. The selected "track-templates" form the mid-level representation of the "visual structure" regions of the video. Our video coding system offers improvements over H.265/H.264 and other methods in a rate-distortion comparison.
更多查看译文
关键词
midlevel representation,visual structure,video compression,video coding system,residual background layer,video frame encoding,Markov random field,MRF
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络