【24h】

Incremental and Parallel Analytics on Astrophysical Data Streams

机译:天体物理数据流上的增量和并行分析

获取原文

摘要

Stream processing methods and online algorithms are increasingly appealing in the scientific and large-scale data management communities due to increasing ingestion rates of scientific instruments, the ability to produce and inspect results interactively, and the simplicity and efficiency of sequential storage access over enormous datasets. This article will showcase our experiences in using off-the-shelf streaming technology to implement incremental and parallel spectral analysis of galaxies from the Sloan Digital Sky Survey (SDSS) to detect a wide variety of galaxy features. The technical focus of the article is on a robust, highly scalable principal components analysis (PCA) algorithm and its use of coordination primitives to realize consistency as part of parallel execution. Our algorithm and framework can be readily used in other domains.
机译:流处理方法和在线算法在科学和大规模数据管理社区中越来越多地吸引了科学仪器的摄取率,能够以交互方式产生和检查结果的能力,以及在巨大数据集中的顺序存储访问的简单性和效率。本文将展示我们使用现成流媒体技术的经验,实现来自斯隆数字天空调查(SDSS)的星系的增量和并行谱分析,以检测各种银河特征。文章的技术焦点是一个强大的,高度可扩展的主成分分析(PCA)算法及其使用协调原语来实现一致性的并行执行的一部分。我们的算法和框架可以很容易地用于其他域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号