An evolutionary algorithm for Feature Selective Double Clustering of text documents

IEEE Congress on Evolutionary Computation(2013)

引用 2|浏览7
暂无评分
摘要
We propose FSDC, an evolutionary algorithm for Feature Selective Double Clustering of text documents. We first cluster the terms existing in the document corpus. The term clusters are then fed into multiobjective genetic algorithms to prune non-informative terms and form sets of keyterms representing topics. Based on the topic keyterms found, representative documents for each topic are extracted. These documents are then used as seeds to cluster all documents in the dataset. FSDC is compared to some well-known co-clusterers on real text datasets. The experimental results show that our algorithm can outperform the competitors.
更多
查看译文
关键词
feature extraction,genetic algorithms,pattern clustering,text analysis,FSDC,document corpus,evolutionary algorithm,feature selective double clustering,multiobjective genetic algorithms,noninformative term pruning,real text datasets,representative document extraction,text documents,topic keyterm set,Genetic algorithm,co-clustering,multiobjective optimization,text clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要