Gaussian Mixtures with Missing Data: an Efficient EM Training Algorithm

Olivier Delalleau,Aaron Courville,Yoshua Bengio

mag（2013）

引用 23|浏览76

暂无评分

摘要

In data-mining applications, we are frequently faced with a large fraction of missing entries in the data matrix, which is problematic for most discriminant machine learning algorithms. A solution that we explore here is the use of a generative model (a mixture of Gaussians with full covariances) to learn the underlying data distribution and replace missing values by their conditional expectation given the observed variables. Since training a Gaussian mixture with many dieren t patterns of missing values can be computationally very expensive, we introduce a spanning-tree based algorithm that signican tly speeds up training in these conditions. Such mixtures of Gaussians can be applied directly to supervised problems (Ghahramani and Jordan, 1994), but we observe that using them for missing value imputation before applying a separate discriminant learning algorithm yields better results. Our contributions are two-fold: 1. We explain why the basic EM training algorithm is not practical in large-dimensional ap- plications in the presence of missing values, and we propose a novel training algorithm that signican tly speeds up training by EM. The algorithm we propose relies on the idea to re-use the computations performed on one training sample as a basis for the next sample, in order to obtain the quantities required by the EM update equations. We show how these computa- tions can be minimized by ordering samples in such a way that two consecutive samples have similar \missing patterns", i.e. share missing values for similar variables. On 28x28 images with random squares of 5x5 pixels being forced to missing values, we obtain a speed-up on the order of 8 compared to standard EM training. 2. We show, both by visual inspection on image data (gure 1) and by feeding the imputed values to another classication algorithm (gure 2), how a mixture of Gaussians can model the data distribution so as to provide a valuable tool for missing values imputation.

查看译文

关键词

conditional expectation,missing values,discrimination learning,mixture of gaussians,visual inspection,spanning tree,machine learning,missing data

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要