Estimating the Risk of Individual Discrimination of Classifiers.

PAKDD (1)(2023)

引用 0|浏览14
暂无评分
摘要
Data owners are increasingly liable for the potential harm caused by using their data on underprivileged communities. Stakeholders seek to identify data characteristics that lead to biased algorithms against specific demographic groups, such as race, gender, age, or religion. We focus on identifying feature subsets of datasets where the ground truth response function from features to observed outcomes differs across demographic groups. To achieve this, we propose FORESEE, a decision tree-based algorithm that generates a score indicating the likelihood of an individual’s response varying with sensitive attributes. Our approach enables us to identify individuals most likely to be misclassified by various classifiers, including Random Forest, Logistic Regression, Support Vector Machine, Multi-Layer Perceptron, and k-Nearest Neighbors. The advantage of our approach is that it allows stakeholders to identify risky samples that may contribute to discrimination and use FORESEE to estimate the risk of upcoming samples.
更多
查看译文
关键词
individual discrimination,classifiers
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要