A Fine-grained Privacy-preserving k-means Clustering Algorithm Upon Negative Databases

2019 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI 2019)(2019)

引用 2|浏览50
暂无评分
摘要
Nowadays, privacy protection has become an important issue in data mining. k-means algorithm is one of the most classical data mining algorithms, and it has been widely studied in the past decade. Negative database (NDB) is a new type of data representation which can protect privacy while supporting distance estimation, so it is promising to apply NDBs to privacy-preserving k-means clustering. Existing privacy-preserving k-means clustering algorithms based on NDBs could effectively protect data privacy, but their clustering performance has a non-negligible degradation. In this paper, we propose a new NDB generation algorithm (named QK-hidden algorithm), and based on this algorithm, we propose a privacy-preserving k-means algorithm. The proposed algorithm can control the accuracy of distance estimation in a fine-grained manner, and thus it can control the clustering results granularly. Experimental results demonstrate the proposed algorithm has better clustering performance than existing privacy-preserving k-means algorithms based on NDBs.
更多
查看译文
关键词
privacy protection, negative database, k-means clustering, Euclidean distance
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要