Improving Fusion of Dimensionality Reduction Methods for Nearest Neighbor Classification
ICMLA(2009)
摘要
In previous studies, performance improvement of nearest neighbor classification of high dimensional data, such as microarrays, has been investigated using dimensionality reduction. It has been demonstrated that the fusion of dimensionality reduction methods, either by fusing classifiers obtained from each set of reduced features, or by fusing all reduced features are better than using any single dimensionality reduction method. However, none of the fusion methods consistently outperform the use of a single dimensionality reduction method. Therefore, a new way of fusing features and classifiers is proposed, which is based on searching for the optimal number of dimensions for each considered dimensionality reduction method. An empirical evaluation on microarray classification is presented, comparing classifier and feature fusion with and without the proposed method, in conjunction with three dimensionality reduction methods; Principal Component Analysis (PCA), Partial Least Squares (PLS) and Information Gain (IG). The new classifier fusion method outperforms the previous in 4 out of 8 cases, and is on par with the best single dimensionality reduction method. The novel feature fusion method is however outperformed by the previous method, which selects the same number of features from each dimensionality reduction method. Hence, it is concluded that the idea of optimizing the number of features separately for each dimensionality reduction method can only be recommended for classifier fusion.
更多查看译文
关键词
single dimensionality reduction method,nearest neighbor classification,new classifier fusion method,dimensionality reduction method,previous method,novel feature fusion method,classifier fusion,fusion method,reduced feature,improving fusion,dimensionality reduction methods,dimensionality reduction,microarrays,principal component analysis,information gain,partial least squares,data mining,cancer,high dimensional data,accuracy,three dimensional,data reduction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络