Mitigating the Curse of Dimensionality in Data Anonymization.
MDAI(2019)
摘要
In general, just suppressing identifiers from released microdata is insufficient for privacy protection. It has been shown that the risk of re-identification increases with the dimensionality of the released records. Hence, sound anonymization procedures are needed to anonymize high-dimensional records. Unfortunately, most privacy models yield very poor utility if enforced on data sets with many attributes. In this paper, we propose a method based on principal component analysis (PCA) to mitigate the curse of dimensionality in anonymization. Our aim is to reduce dimensionality without incurring large utility losses. We instantiate our approach with anonymization based on differential privacy. Empirical work shows that using differential privacy on the PCA-transformed and dimensionality-reduced data set yields less information loss than directly using differential privacy on the original data set.
更多查看译文
关键词
Privacy preserving data publishing, Curse of dimensionality, Differential privacy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络