Scalable Distributed Change Detection from Astronomy Data Streams Using Local, Asynchronous Eigen Monitoring Algorithms

SDM(2009)

引用 31|浏览86
暂无评分
摘要
This paper considers the problem of change detection using lo- cal distributed eigen monitoring algorithms for next generation of astronomy petascale data pipelines such as the Large Synop- tic Survey Telescopes (LSST). This telescope will take repeat im- ages of the night sky every 20 seconds, thereby generating 30 ter- abytes of calibrated imagery every night that will need to be co- analyzed with other astronomical data stored at different locations around the world. Change point detection and event classification in such data sets may provide useful insights to unique astronom- ical phenomenon displaying astrophysically significant variations: quasars, supernovae, variable stars, and potentially hazardous aster- oids. However, performing such data mining tasks is a challenging problem for such high-throughput distributed data streams. In this paper we propose a highly scalable and distributed asynchronous algorithm for monitoring the principal components (PC) of such dynamic data streams. We demonstrate the algorithm on a large set of distributed astronomical data to accomplish well-known astron- omy tasks such as measuring variations in the fundamental plane of galaxy parameters. The proposed algorithm is provably correct (i.e. converges to the correct PCs without centralizing any data) and can seamlessly handle changes to the data or the network. Real exper- iments performed on Sloan Digital Sky Survey (SDSS) catalogue data show the effectiveness of the algorithm.
更多
查看译文
关键词
fundamental plane,change detection,principal component,variable stars,data mining,dynamic data,change point detection,high throughput
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要