The BAHSIC family of gene selection algorithms
mag(2013)
摘要
Identifying significant genes among thousands of sequences on a microarray is a central challenge for cancer research in bioinformatics. A multitude of methods have been proposed for this task of feature selection, yet the selected gene lists differ greatly between different methods. To accomplish biologically meaningful gene selection from microarray data, we have to understand the theoretical connections and the differences between these methods. In this article, we define a kernelbased framework for feature selection based on the Hilbert-Schmidt Independence Criterion and backward elimination, called BAHSIC. We show that several well-known feature selectors are instances of BAHSIC, thereby clarifying their relationship. Furthermore, by choosing a different kernel, BAHSIC allows us to easily define novel feature selection algorithms. As a further advantage, feature selection via BAHSIC works directly on multiclass problems. In a broad experimental evaluation, the members of the BAHSIC family reach high levels of accuracy and robustness when compared to other feature selection techniques.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络