Identifying stable objects for accelerating the classification phase of k-means.

ADVANCES ON P2P, PARALLEL, GRID, CLOUD AND INTERNET COMPUTING(2017)

引用 1|浏览6
暂无评分
摘要
This work presents an improved version of the K-Means algorithm, this version consists in a simple heuristic where objects that remains in the same group, between the current and the previous iteration, are identified and excluded from calculi in the classification phase for subsequent iterations. In order to evaluate the improved version versus the standard, three synthetic and seven well-known real instances of specialized literature were used. Experimental results showed that the proposed heuristic spends less time than the standard algorithm. The best result was obtained when the Transactions instance was grouped into 200 clusters, achieving a time reduction of 90.1% regarding the standard version, with only a grouping quality reduction of 3.97%.
更多
查看译文
关键词
Execution Time, Synthetic Dataset, Subsequent Iteration, Classification Phase, Pattern Recognition Letter
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要