Utility-Privacy Tradeoffs in Databases: An Information-Theoretic Approach

IEEE Transactions on Information Forensics and Security(2013)

引用 397|浏览53
暂无评分
摘要
Ensuring the usefulness of electronic data sources while providing necessary privacy guarantees is an important unsolved problem. This problem drives the need for an analytical framework that can quantify the privacy of personally identifiable information while still providing a quantifiable benefit (utility) to multiple legitimate information consumers. This paper presents an information-theoretic framework that promises an analytical model guaranteeing tight bounds of how much utility is possible for a given level of privacy and vice-versa. Specific contributions include: 1) stochastic data models for both categorical and numerical data; 2) utility-privacy tradeoff regions and the encoding (sanization) schemes achieving them for both classes and their practical relevance; and 3) modeling of prior knowledge at the user and/or data source and optimal encoding schemes for both cases.
更多
查看译文
关键词
data analysis,data privacy,database management systems,information theory,analytical model,database,electronic data source,encoding scheme,information consumer,information theoretic approach,personally identifiable information privacy,privacy guarantee,quantifiable benefit,stochastic data model,utility-privacy tradeoff,Utility,databases,equivocation,privacy,rate-distortion theory,side information
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要