Practical Deep Learning with Bayesian Principles.

Kazuki Osawa,Siddharth Swaroop,Anirudh Jain,Runa Eschenhagen,Richard E. Turner,Rio Yokota,Mohammad Emtiyaz Khan

ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019)（2019）

引用 239|浏览0

暂无评分

摘要

Bayesian methods promise to fix many shortcomings of deep learning, but they are impractical and rarely match the performance of standard methods, let alone improve them. In this paper, we demonstrate practical training of deep networks with natural-gradient variational inference. By applying techniques such as batch normalisation, data augmentation, and distributed training, we achieve similar performance in about the same number of epochs as the Adam optimiser, even on large datasets such as ImageNet. Importantly, the benefits of Bayesian principles are preserved: predictive probabilities are well-calibrated, uncertainties on out-of-distribution data are improved, and continual-learning performance is boosted. This work enables practical deep learning while preserving benefits of Bayesian principles. A PyTorch implementation(1) is available as a plug-and-play optimiser.

查看译文

关键词

deep learning,batch normalization,data augmentation,batch normalisation,bayesian methods

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要