Privacy-preserving query log mining for business confidentiality protection

TWEB(2010)

引用 21|浏览39
暂无评分
摘要
We introduce the concern of confidentiality protection of business information for the publication of search engine query logs and derived data. We study business confidentiality, as the protection of nonpublic data from institutions, such as companies and people in the public eye. In particular, we relate this concern to the involuntary exposure of confidential Web site information, and we transfer this problem into the field of privacy-preserving data mining. We characterize the possible adversaries interested in disclosing Web site confidential data and the attack strategies that they could use. These attacks are based on different vulnerabilities found in query log for which we present several anonymization heuristics to prevent them. We perform an experimental evaluation to estimate the remaining utility of the log after the application of our anonymization techniques. Our experimental results show that a query log can be anonymized against these specific attacks while retaining a significant volume of useful data.
更多
查看译文
关键词
web sites,business confidentiality,nonpublic data,business confidentiality protection,query log mining,query log publication,confidential data,privacy preservation,search engine query log,anonymization heuristics,useful data,privacy-preserving data mining,query log,queries,web site,anonymization technique,search engine
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要