K-Fact - Using the Frequency Factor for Clustering Categorical Data.

BRACIS(2019)

引用 0|浏览9
暂无评分
摘要
K-modes, an extension of K-means, aims to cluster categorical data by using the mode to update clustersu0027 centroids, as well an overlap measure to define the distance between objects and centers. This work presents K-fact, an extension of K-modes, which proposes the frequency factor, a new probability-based measure to update the centroids, and the use of several similarity measures suitable to categorical values. In our validity scheme, we ran both K-modes and K-fact over real-world and synthetic datasets in order to compare them with an index from the external validity criteria. K-modes is always outperformed by some parameter set used in K-fact, especially on datasets with higher or moderate variance.
更多
查看译文
关键词
K-fact,K-modes,clustering,categorical data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要