Manifold Mixup: Learning Better Representations by Interpolating Hidden States
arXiv: Machine Learning, 2018.
Deep networks often perform well on the data distribution on which they are trained, yet give incorrect (and often very confident) answers when evaluated on points from off of the training distribution. This is exemplified by the adversarial examples phenomenon but can also be seen in terms of model generalization and domain shift. Ideall...More