A Non-Parametric Model for Accurate and Provably Private Synthetic Data Sets

ARES(2017)

引用 2|浏览9
暂无评分
摘要
Generating synthetic data is a well-known option to limit disclosure risk in sensitive data releases. The usual approach is to build a model for the population and then generate a synthetic data set solely based on the model. We argue that building an accurate population model is difficult and we propose instead to approximate the original data as closely as privacy constraints permit. To enforce an ex ante privacy level when generating synthetic data, we introduce a new privacy model called ε synthetic privacy. Then, we describe a synthetic data generation method that satisfies ε-synthetic privacy. Finally, we evaluate the utility of the synthetic data generated with our method.
更多
查看译文
关键词
Synthetic data, non-parametric methods, formal privacy, epsilon-synthetic privacy
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要