LDA classifier monitoring in distributed streaming systems.

Journal of Parallel and Distributed Computing(2019)

引用 9|浏览50
暂无评分
摘要
An important problem in real systems for mining data streams is to detect changes in the dynamic model describing the temporal data. Such changes indicate that the underlying data has undergone a transition which may well require attention. A distributed setting poses one of the main challenges in this type of change detection. In a distributed setting, model training requires centralizing the data from all nodes (hereafter, synchronization), which is very costly in terms of communication. In order to minimize communication, a monitoring algorithm should be executed locally at each node, while preserving the validity of the global model (that is, the model that will be computed if a synchronization takes place). To achieve this goal, we propose the first communication-efficient algorithm for monitoring a classification model over distributed, dynamic data streams. While the approach is general, here we concentrate on Linear Discriminant Analysis (LDA), a popular method for classification and dimensionality reduction in many fields. We mainly apply tools from the realms of linear algebra and multi-variate analysis in order to solve the problem at hand. The resulting implementation is quite straightforward.
更多
查看译文
关键词
Distributed monitoring,Linear discriminant analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要