Reliability Of Using Single Specialist Annotation For Designing And Evaluating Automatic Segmentation Methods: A Skull Stripping Case Study

2018 IEEE 15TH INTERNATIONAL SYMPOSIUM ON BIOMEDICAL IMAGING (ISBI 2018)(2018)

引用 24|浏览21
暂无评分
摘要
Manual annotation by specialists has been largely used as a gold standard for evaluation of automatic segmentation methods in medical imaging and also for designing data-driven approaches. The use of single-specialist manual annotation in this context may lead to a poor assessment and/or training of automatic segmentation methods. This work investigates the limitations of using single-specialist manual annotation as a gold standard. This investigation is framed within a common case study: application of skull stripping to magnetic resonance (MR) images. Twelve MR volumes acquired with scanners of different vendors and at different magnetic field strengths were manually annotated. Each volume was annotated twice. Nine publicly available automatic skull stripping methods were applied to these twelve volumes. The Dice coefficient overlap metric was used to rank the automatic skull stripping methods. Our results show differences (p < 0.05) that depended on the manual annotation reference used to compute the metrics. The ranking of the best skull stripping methods changed with the gold standard. Manual annotation also plays a fundamental role in data-driven approaches, such as convolutional neural networks (CNNs). An auto-context CNN was evaluated using two public datasets in addition to our twelve volumes, to assess manual annotation influence both during training and validation of the algorithm. Our results indicate that the CNN has a strong bias towards the manual annotation used during training, potentially resulting in poor generalization of the model. These results highlight the importance of having robust gold standards, and raise the need for developing better methodologies for generating gold standards.
更多
查看译文
关键词
manual annotation, gold standard, automatic brain structure segmentation, skull stripping
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要