Visual pertinent 2D-to-3D video conversion by multi-cue fusion

ICIP(2011)

引用 25|浏览26
暂无评分
摘要
We describe an approach to2D-to-3D video conversion for the stereoscopic display. Targeting the problem of synthesizing the frames of a virtual 'right view' from the original monocular 2D video, we generate the stereoscopic video in steps as following. (1) A 2.5D depth map is first estimated in a multi-cue fusion manner by leveraging motion cues and photometric cues in video frames with a depth prior of spatial and temporal smoothness. (2) The depth map is converted to a disparity map with considering both the displaying device size and human's stereoscopic visual perception constraints. (3) We fix the original 2D frames as the 'left view' ones, and warp them to "virtually viewed" right ones according to the predicted disparity value. The main contribution of this method is to combine motion and photometric cues together to estimate depth map. In the experiments, we apply our method to converting several movie clips of well-known films into stereoscopic 3D video and get good results1.
更多
查看译文
关键词
video signal processing,original monocular 2d video frame,image fusion,photometric cues,virtual view synthesis,motion cues,2d-to-3d,stereoscopic,stereoscopic display,disparity,multicue fusion,visual pertinent 2d-to-3d video conversion,human stereoscopic visual perception constraint,disparity map,depth,spatial smoothness,stereo image processing,temporal smoothness,stereoscopic 3d video frame,depth map estimation,image processing,motion pictures,three dimensional,visual perception,depth map,visualization,estimation,2d to 3d
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要