Medical Image De-Identification using Cloud Services

B. Kopchick, J. Klenk, T. Carlson, M. Kumpatla, S. Klimov, D. Mikdadi, Q. Pan, S. Gustafson, J. Kaltman,U. Wagner,D. Clunie,K. Farahani

MEDICAL IMAGING 2022: IMAGING INFORMATICS FOR HEALTHCARE, RESEARCH, AND APPLICATIONS(2022)

引用 1|浏览11
暂无评分
摘要
Purpose: Patient privacy rules require removal of Protected Health Information (PHI) before sharing images publicly. Manual de-identification is no longer scalable due to the rapid increase in imaging data volume. Our goal was to configure and test the efficacy of an automated medical image de-identification (MIDI) pipeline using cloud services. Materials and Methods: Training and test datasets for validation of image de-identification, specifically prepared by placement of synthetic PHI in DICOM headers and image pixel data, were prepared by The Cancer Imaging Archive (TCIA). These datasets included 1,836/14,372 images from 21/93 patients, respectively. Answer keys based on TCIA de-identification conventions were made available for the two datasets. The MIDI pipeline was configured using the Google Cloud Platform Healthcare API, which is based on Google's Data Loss Prevention API for sensitive information detection. Performance was also measured by monitoring throughput. Results: For DICOM header data elements, 99.8% of expected actions were performed correctly. The two incorrect actions included one false-positive case (information removed incorrectly), and one false-negative case (PHI not removed). For the image pixel data, one false-positive was noted. There were no false negatives; all sensitive information was correctly removed from all image pixel data. Throughput averaged at 58.4 images per second. Conclusion: The current implementation of the MIDI pipeline holds great promise for automated de-identification at scale. However, verification by a human expert is currently recommended. Optimization of the underlying algorithm could further increase accuracy.
更多
查看译文
关键词
De-identification, DICOM, Cloud, GCP
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要