Sound Explanation for Trustworthy Machine Learning

CoRR(2023)

引用 0|浏览42
暂无评分
摘要
We take a formal approach to the explainability problem of machine learning systems. We argue against the practice of interpreting black-box models via attributing scores to input components due to inherently conflicting goals of attribution-based interpretation. We prove that no attribution algorithm satisfies specificity, additivity, completeness, and baseline invariance. We then formalize the concept, sound explanation, that has been informally adopted in prior work. A sound explanation entails providing sufficient information to causally explain the predictions made by a system. Finally, we present the application of feature selection as a sound explanation for cancer prediction models to cultivate trust among clinicians.
更多
查看译文
关键词
machine learning,sound,explanation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络