DisPatch: Distributed Pattern Matching over Streaming Time Series

2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA)(2018)

引用 3|浏览61
暂无评分
摘要
Matching a dictionary of patterns (i.e. subsequences) against a streaming time series to identify occurrences is one of the primary components of real-time monitoring systems such as earthquake monitoring, power consumption monitoring, and patient monitoring. These domains critically depend on timely alarms immediately after events (i.e. earthquake, fire, seizure, etc.) start. Until now, the problem has been solved independently by smart pruning, efficient approximation, and pattern indexing without bounding the delay between pattern occurrence and detection time. Moreover, complexity of the dictionary matching problem is quickly growing with larger dictionary sizes, faster data streams, and stricter delay requirements; pushing existing pattern matching systems to their limits. In this paper, we describe a robust distributed matching system, called DisPatch (Distributed Pattern Matching), that matches a pattern with a guaranteed maximum delay after the pattern appears in the stream. We develop and evaluate a novel distribution strategy and integrate state-of-the-art algorithmic optimization techniques to horizontally scale to a high data rate and a large dictionary size. We show three use cases of DisPatch in seismic, patient and power consumption monitoring.
更多
查看译文
关键词
dictionary matching,distribution strategy,robust distributed matching system,patient monitoring,earthquake monitoring,real-time monitoring systems,streaming time series,Distributed Pattern Matching,DisPatch,power consumption monitoring
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要