Revisiting the Calibration of Modern Neural Networks.

Matthias Minderer,Josip Djolonga,Rob Romijnders,Frances Hubis,Xiaohua Zhai,Neil Houlsby,Dustin Tran,Mario Lucic

Annual Conference on Neural Information Processing Systems（2021）

引用 204|浏览97

暂无评分

摘要

Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically relate model calibration and accuracy, and find that the most recent models, notably those not using convolutions, are among the best calibrated. Trends observed in prior model generations, such as decay of calibration with distribution shift or model size, are less pronounced in recent architectures. We also show that model size and amount of pretraining do not fully explain these differences, suggesting that architecture is a major determinant of calibration properties.

查看译文

关键词

calibration,neural networks

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要