An Instantiation of Hierarchical Distance-Based Conceptual Clustering for Propositional Learning

Pacific-Asia Conference on Knowledge Discovery and Data Mining(2009)

引用 6|浏览8
暂无评分
摘要
In this work we analyse the relationship between distance and generalisation operators for real numbers, nominal data and tuples in the context of hierarchical distance-based conceptual clustering (HDCC). HDCC is a general approach to conceptual clustering that extends the traditional algorithm for hierarchical clustering by producing conceptual generalisations of the discovered clusters. This makes it possible to combine the flexibility of changing distances for several clustering problems and the advantage of having concepts which are crucial for tasks as summarisation and descriptive data mining in general. In this work we propose a set of generalisation operators and distances for the data types mentioned before and we analyse the properties by them satisfied on the basis of three different levels of agreement between the clustering hierarchy obtained from the linkage distance and the hierarchy obtained by using generalisation operators.
更多
查看译文
关键词
generalisation operator,conceptual generalisations,hierarchical distance-based conceptual clustering,conceptual clustering,propositional learning,hierarchical clustering,descriptive data mining,clustering hierarchy,clustering problem,propositional learning.,distances,data type,nominal data,generalisation,satisfiability,data mining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要