Exploiting the 3D Structures Observed in 2D Video Sequences for Motion Compensation

2021 Picture Coding Symposium (PCS)(2021)

引用 0|浏览12
暂无评分
摘要
Video coding standards widely use 2D translational and affine motion models to compensate for the motion between frames. Noting that most 2D videos are initially captured in a 3D space may lead us to a new direction: using 3D scene geometry for 2D motion compensation. We introduce a method for rendering virtual Reference Pictures (RPs) based on the 3D information extracted from 2D video sequences. The synthesized RPs are offered to the Versatile Video Coding (VVC) encoder to serve as motion compensation references. The 3D information (camera poses and 3D scene geometry) is estimated at the encoder and transmitted to the decoder as an overhead bitstream. Multiple techniques are discussed to minimize this overhead. Simulation results show up to 5% coding gain compared to our anchor (VTM 10.0), with an acceptable increase in decoding time.
更多
查看译文
关键词
VVC,Point Cloud,Mesh Compression,Mesh Decimation,Google Draco,3D Geometry,Video Coding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要