Automatic extraction of clusters from hierarchical clustering representations

PAKDD(2003)

引用 194|浏览37
暂无评分
摘要
Hierarchical clustering algorithms are typically more effective in detecting the true clustering structure of a data set than partitioning algorithms. However, hierarchical clustering algorithms do not actually create clusters, but compute only a hierarchical representation of the data set. This makes them unsuitable as an automatic pre-processing step for other algorithms that operate on detected clusters. This is true for both dendrograms and reachability plots, which have been proposed as hierarchical clustering representations, and which have different advantages and disadvantages. In this paper we first investigate the relation between dendrograms and reachability plots and introduce methods to convert them into each other showing that they essentially contain the same information. Based on reachability plots, we then introduce a technique that automatically determines the significant clusters in a hierarchical cluster representation. This makes it for the first time possible to use hierarchical clustering as an automatic pre-processing step that requires no user interaction to select clusters from a hierarchical cluster representation.
更多
查看译文
关键词
automatic pre-processing step,reachability plot,automatic extraction,true clustering structure,hierarchical clustering algorithm,hierarchical representation,hierarchical clustering,significant cluster,hierarchical clustering representation,hierarchical cluster representation,optics,dendrogram
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要