A New Graph-Based Algorithm for Clustering Documents

Pisa(2008)

引用 5|浏览0
暂无评分
摘要
In this paper a new algorithm, called CStar, for document clustering is presented. This algorithm improves recently developed algorithms like Generalized Star (GStar) and ACONS algorithms, originally proposed for reducing some drawbacks presented in previous Star-like algorithms.The CStar algorithm uses the Condensed Star-shaped Sub-graph concept defined by ACONS, but defines a new heuristic that allows to construct a new cover of the thresholded similarity graph and to reduce the drawbacks presented in GStar and ACONS algorithms. The experimentation over standard document collections shows that our proposal outperforms previously defined algorithms and other related algorithms used to document clustering.
更多
查看译文
关键词
document clustering,cstar algorithm,standard document collection,new heuristic,previous star-like algorithm,new graph-based algorithm,clustering documents,new cover,new algorithm,condensed star-shaped sub-graph concept,related algorithm,acons algorithm,graph theory,text mining,optics,clustering,algorithm design and analysis,astrophysics,redundancy,data mining,clustering algorithms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要