EBStereo: edge-based loss function for real-time stereo matching

VISUAL COMPUTER(2023)

引用 0|浏览0
暂无评分
摘要
Deep learning-based stereo matching has made significant progress, but it still faces challenges: The disparity prediction error maps of current models show that errors are concentrated primarily on object boundaries. We find that executing the smooth L1 loss function on the entire region during stereo matching model training cannot effectively address the imbalance between edge regions and flat regions, resulting in poor disparity estimates for edge regions. In this paper, a new weighted smooth L1 loss function, which considers the loss function calculation on edge regions and can yield improved accuracy, is proposed. An improved bilateral grid upsampling module is also added to the training model, and a strategy is adopted to balance the computational consumption introduced by the new loss function-weighted item, allowing for real-time inference. Extensive experiments conducted on two datasets, i.e., Scene Flow and KITTI, verify the simplicity and effectiveness of this approach. Under the condition of 33 frames per second (FPS), the endpoint error of the proposed model can be improved to 0.63. In addition, the proposed edge-based loss function can be easily embedded into many existing stereo matching networks, such as GwcNet, AANet, and PSMNet. After embedding the proposed edge-based loss function, the reduction rates of the endpoint errors of the existing models can be improved to 3.5%, 11.6%, and 27.2% for GwcNet, AANet, and PSMNet, respectively.
更多
查看译文
关键词
Stereo matching,Bilateral grid learning,Loss function,Deep learning,Feature fusion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要