Kernel MDL to Determine the Number of Clusters

MACHINE LEARNING AND DATA MINING IN PATTERN RECOGNITION, PROCEEDINGS(2007)

引用 29|浏览0
暂无评分
摘要
In this paper we propose a new criterion, based on Minimum Description Length (MDL), to estimate an optimal number of clusters. This criterion, called Kernel MDL (KMDL), is particularly adapted to the use of kernel K-means clustering algorithm. Its formulation is based on the definition of MDL derived for Gaussian Mixture Model (GMM). We demonstrate the efficiency of our approach on both synthetic data and real data such as SPOT5 satellite images.
更多
查看译文
关键词
spot5 satellite image,gaussian mixture model,minimum description length,kernel mdl,optimal number,new criterion,synthetic data,k means clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要