Exponentially Consistent Nonparametric Clustering of Data Streams with Composite Distributions
ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2024)
摘要
This paper focuses on nonparametric clustering of data streams generated from unknown distributions. Existing results on exponentially consistent nonparametric clustering assume that the maximum intra-cluster distance (d
L
) is smaller than the minimum inter-cluster distance (d
H
). We show that exponential consistency can be achieved for single linkage-based (SLINK) clustering under a less strict assumption, d
I
< d
H
, where d
I
is the maximum intra-cluster nearest neighbour distance. Note that d
I
< d
L
in general. Then, we propose a sequential clustering algorithm based on SLINK. Simulation results show that the sequential SLINK algorithm requires fewer expected number of samples than the fixed-sample size SLINK algorithm for the same probability of error. We also identify examples where k-medoids clustering is unable to find the true clusters, but SLINK is exponentially consistent.
更多查看译文
关键词
Nonparametric detection,clustering,consistency,sequential detection,linkage-based clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要