Propensity score based conditional group swapping for disclosure limitation of strata-defining variables.

PRIVACY IN STATISTICAL DATABASES: UNESCO CHAIR IN DATA PRIVACY(2016)

引用 0|浏览2
暂无评分
摘要
In this paper we propose a method for statistical disclosure limitation of categorical variables that we call Conditional Group Swapping. This approach is suitable for design and strata-defining variables, the cross-classification of which leads to the formation of important groups or subpopulations. These groups are considered important because from the point of view of data analysis it is desirable to preserve analytical characteristics within them. In general data swapping can be quite distorting ([12, 18, 15]), especially for the relationships between the variables not only within the subpopulations but for the overall data. To reduce the damage incurred by swapping, we propose to choose the records for swapping using conditional probabilities which depend on the characteristics of the exchanged records. In particular, our approach exploits the results of propensity scores methodology for the computation of swapping probabilities. The experimental results presented in the paper show good utility properties of the method.
更多
查看译文
关键词
Statistical disclosure limitation (SDL), Group swapping, Strata, Subpopulations, Propensity scores
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要