Loss Max-Pooling for Semantic Image Segmentation

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2017)

引用 146|浏览111
暂无评分
摘要
We introduce a novel loss max-pooling concept for handling imbalanced training data distributions, applicable as alternative loss layer in the context of deep neural networks for semantic image segmentation. Most real-world semantic segmentation datasets exhibit long tail distributions with few object categories comprising the majority of data and consequently biasing the classifiers towards them. Our method adaptively re-weights the contributions of each pixel based on their observed losses, targeting under-performing classification results as often encountered for under-represented object classes. Our approach goes beyond conventional cost-sensitive learning attempts through adaptive considerations that allow us to indirectly address both, inter- and intra-class imbalances. We provide a theoretical justification of our approach, complementary to experimental analyses on benchmark datasets. In our experiments on the Cityscapes and Pascal VOC 2012 segmentation datasets we find consistently improved results, demonstrating the efficacy of our approach.
更多
查看译文
关键词
semantic image segmentation,real-world semantic segmentation datasets,object categories,under-represented object classes,imbalanced training data distributions,deep neural networks,loss max-pooling concept,interclass imbalance,intraclass imbalance,tyscapes dataset,Pascal VOC 2012 segmentation dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要