Using Classification and Visualization on Pattern Databases for Gene Expression Data Analysis

PaRMa(2004)

引用 28|浏览16
暂无评分
摘要
We are designing new data mining techniques on gene ex- pression data, more precisely inductive querying techniques that extract a priori interesting bi-sets, i.e., sets of objects (or biological situations) and associated sets of attributes (or genes). The so-called (formal) con- cepts are important special cases of a priori interesting bi-sets in derived boolean expression matrices, e.g., matrices that encode over-expression of genes. It has been shown recently that the extraction of every concept is often possible from typical gene expression data because the number of biological situations is generally quite small (a few tens). In specific applications, we have been able to extract every concept and it can lead to millions of concepts. Obviously, post-processing these huge volumes of patterns for the discovery of biologically relevant information is challeng- ing. It is useful since the added-value of transcription module discovery is very high and formal concepts can be seen as putative transcription modules. We describe our ongoing research on concept post-processing by means of classification and visualization. It has been applied to a real-life gene expression data set with a promising feedback from end-users.
更多
查看译文
关键词
data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要