MSDC-Net: Multi-Scale Dense and Contextual Networks for Stereo Matching

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference(2019)

引用 6|浏览47
暂无评分
摘要
Disparity prediction from stereo images is essential to computer vision applications such as autonomous driving, 3D model reconstruction, and object detection. To more accurately predict disparity map, a novel deep learning architecture (called MSDC-Net) for detecting the disparity map from a rectified pair of stereo images is proposed. Our MSDC-Net contains two modules: the multi-scale fusion 2D convolution module and the multi-scale residual 3D convolution module. The multi-scale fusion 2D convolution module exploits the potential multi-scale features, which extracts and fuses the different scale features by Dense-Net. The multi-scale residual 3D convolution module learns the different scale geometry context from the cost volume which aggregated by the multi-scale fusion 2D convolution module. Experimental results on Scene Flow and KITTI datasets demonstrate that our MSDC-Net significantly outperforms other approaches in the non-occluded region.
更多
查看译文
关键词
MSDC-Net,multiscale dense networks,stereo image processing,multiscale fusion 2D convolution module,multiscale residual 3D convolution module,scale feature contextual networks,scale geometry contextual networks,stereo matching,computer vision applications,autonomous driving,3D model reconstruction,object detection,deep learning architecture,disparity map detection,KITTI datasets,Scene Flow datasets
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要