Weakly supervised instance segmentation via peak mining and filtering

Zuxian Huang, Dongsheng Pan,Gangshan Wu

IET IMAGE PROCESSING(2024)

引用 0|浏览1
暂无评分
摘要
Learning the full extent of pixel-level instance response in a weakly supervised manner remains unsatisfactory. Peak response maps (PRMs) localizes the discriminative object regions but cannot provide complete instance information, suffering from incomplete segmentation and unreliable mask prediction by noisy proposal retrieval. This work tackles this challenging problem by mining diverse class peak responses that include more discriminative and complete object regions and retrieving more reliable proposals from noisy segment proposal galleries. First, the existing method is enhanced with two more classification branches, thus contributing to more diverse and abundant instance regions from peak response maps. The mined class peak responses from two of the branches are then merged to generate more complete peak response maps by a clustering approach in their deep feature space. Then, instance segmentation masks are retrieved from a noisy object segment proposal gallery with class confidence, which is calculated by a normal classifier to obtain cleaner mask prediction. Finally, the pseudo-supervision can be used to train an instance segmentation network in a fully supervised manner. Experiments on the PASCAL VOC 2012 dataset and COCO dataset show that the approach works effectively and outperforms other counterparts by a margin of more than 6 %, 4%, and 3% with the mean average precision (mAP) at IoU threshold of 0.25, 0.5 and 0.75, respectively. An instance mining approach is proposed to tackle the challenge of incomplete region localization for image-level weakly supervised instance segmentation tasks by discovering more unseen instances and more complete object regions. The challenge of noisy mask prediction is tackled by integrating class confidence to obtain more reliable and cleaner instance masks through an instance filtering approach. On the Pascal VOC 2012 datasets and COCO dataset, an implementation of the model with popular DCNNs, e.g. ResNet50, substantially improves the performance of this task image
更多
查看译文
关键词
computer vision,image segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要