Nearly Optimal Dynamic $k$-Means Clustering for High-Dimensional Data.

arXiv: Data Structures and Algorithms(2018)

引用 23|浏览90
暂无评分
摘要
We consider the $k$-means clustering problem in the dynamic streaming setting, where points from a discrete Euclidean space ${1, 2, ldots, Delta}^d$ can be dynamically inserted to or deleted from the dataset. For this problem, we provide a one-pass coreset construction algorithm using space $tilde{O}(kcdot mathrm{poly}(d, logDelta))$, where $k$ is the target number of centers. To our knowledge, this is the first dynamic geometric data stream algorithm for $k$-means using space polynomial in dimension and nearly optimal (linear) in $k$.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要