Privacy preserving medical data publishing

Privacy preserving medical data publishing(2012)

引用 24|浏览12
暂无评分
摘要
There is an increasing need for sharing of medical information for public health research. Data custodians and honest brokers have an ethical and legal requirement to protect the privacy of individuals when publishing medical datasets. This dissertation presents an end-to-end Health Information DE-identification (HIDE) system and framework that promotes and enables privacy preserving medical data publishing of textual, structured, and aggregated statistics gleaned from electronic health records (EHRs). This work reviews existing de-identification systems, personal health information (PHI) detection, record anonymization, and differential privacy of multi-dimensional data. HIDE integrates several state-of-the-art algorithms into a unified system for privacy preserving medical data publishing. The system has been applied to a variety of real-world and academic medical datasets. The main contributions of HIDE include: 1) a conceptual framework and software system for anonymizing heterogeneous health data, 2) an adaptation and evaluation of information extraction techniques and modification of sampling techniques for protected health information (PHI) and sensitive information extraction in health data, and 3) applications and extension of privacy techniques to provide privacy preserving publishing options to medical data custodians, including de-identified record release with weak privacy and multidimensional statistical data release with strong privacy.
更多
查看译文
关键词
multidimensional statistical data release,medical data publishing,privacy technique,differential privacy,heterogeneous health data,multi-dimensional data,health data,medical data custodian,medical data,data custodian
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要