Gene Expression Data Classification Using Locality Preserving Projections

Houqin Bian, Ronal Chung

Bioinformatics and Bioengineering(2011)

引用 1|浏览1
暂无评分
摘要
Classification analysis of gene expression data could lead to knowledge of gene functions and diseases mechanisms. However, the data involve nonlinear interactions among genes and environmental factors. Worst yet, while the data are usually of high dimensions, the sample sizes acquirable are generally relatively small, resulting in the well known difficulty - the curse of dimensionality - in the classification task. This work describes how gene expression data can be analyzed using Locality Preserving Projections (LPP) manifold learning method. LPP is a dimensionality reduction strategy for feature selection and visualization. Using LPP, the high dimensional gene expression data are mapped to a low dimensional subspace for data analysis. LPP finds the optimal linear approximations to the eigenfunctions of the Laplace Beltrami operator on the manifold. Not only does it share many convenient data-representation properties of the nonlinear techniques like Laplacian Eigenmaps or Locally Linear Embedding, it is also linear and more crucially is defined everywhere in the ambient space rather than just on the training data points. Comparative experimental results with PC A, LDA, LLE, etc. on different gene expression datasets show that the LPP-based method has the potential of being more efficient for complex gene expression data classification.
更多
查看译文
关键词
different gene expression datasets,training data point,feature visualization,locally linear embedding,laplace equations,microarray data analysis,local preserving projection,gene function,dimensionality reduction strategy,gene functions,optimal linear approximations,genetics,classification analysis,gene expression,data classification,laplacian eigenmaps,disease mechanisms,data analysis,laplace beltrami operator,eigenfunctions,data-representation properties,complex gene expression data,environmental factors,genetic algorithms,gene expression data,locality preserving projections,lpp-based method,classification,feature selection,gene expression data classification,eigenvalues and eigenfunctions,nonlinear interactions,bioinformatics,high dimensional gene expression,classification task,low-dimensional subspace,sample size,curse of dimensionality,linear approximation,accuracy,data representation,manifold learning,principal component analysis,symmetric matrices,manifolds,feature extraction,general relativity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要