首页> 外文会议>SIAM International Conference on Data Mining >Scalable Distributed Change Detection from Astronomy Data Streams using Local, Asynchronous Eigen Monitoring Algorithms
【24h】

Scalable Distributed Change Detection from Astronomy Data Streams using Local, Asynchronous Eigen Monitoring Algorithms

机译:使用本地异步eIGEN监测算法从天文数据流中可扩展的分布式变化检测

获取原文

摘要

This paper considers the problem of change detection using local distributed eigen monitoring algorithms for next generation of astronomy petascale data pipelines such as the Large Synoptic Survey Telescopes (LSST). This telescope will take repeat images of the night sky every 20 seconds, thereby generating 30 terabytes of calibrated imagery every night that will need to be coanalyzed with other astronomical data stored at different locations around the world. Change point detection and event classification in such data sets may provide useful insights to unique astronomical phenomenon displaying astrophysically significant variations: quasars, supernovae, variable stars, and potentially hazardous asteroids. However, performing such data mining tasks is a challenging problem for such high-throughput distributed data streams. In this paper we propose a highly scalable and distributed asynchronous algorithm for monitoring the principal components (PC) of such dynamic data streams. We demonstrate the algorithm on a large set of distributed astronomical data to accomplish well-known astronomy tasks such as measuring variations in the fundamental plane of galaxy parameters. The proposed algorithm is provably correct (i.e. converges to the correct PCs without centralizing any data) and can seamlessly handle changes to the data or the network. Real experiments performed on Sloan Digital Sky Survey (SDSS) catalogue data show the effectiveness of the algorithm.
机译:本文考虑了使用本地分布式特征监测算法的变化检测问题,用于下一代天文学曲叶数据管道,如大型舞蹈调查望远镜(LSST)。这台望远镜每隔20秒就重复夜空的重复图像,从而每天晚上生成30岁的校准图像,需要与世界各地的不同地点的其他天文数据共同化。在这种数据集中改变点检测和事件分类可以提供对展示天文性显着变化的独特天文现象的有用的见解:Quasars,Supernovae,可变星星和潜在危险的小行星。然而,执行此类数据挖掘任务是这种高吞吐量分布式数据流的具有挑战性问题。在本文中,我们提出了一种高度可扩展和分布的异步算法,用于监视此类动态数据流的主要组件(PC)。我们展示了大量分布式天文数据的算法,以实现众所周知的天文任务,例如测量Galaxy参数的基本平面中的变化。所提出的算法可证明是正确的(即,在没有集中任何数据的情况下收敛到正确的PC),并且可以无缝地处理对数据或网络的改变。在斯隆数字天空调查(SDSS)目录数据上进行的真实实验显示了算法的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号