首页> 外文会议>Conference on software and cyberinfrastructure for astronomy IV >The HARPS-N archive through a Cassandra, NoSQL database suite?
【24h】

The HARPS-N archive through a Cassandra, NoSQL database suite?

机译:通过Cassandra NoSQL数据库套件进行HARPS-N归档?

获取原文

摘要

The TNG-INAF is developing the science archive for the WEAVE instrument. The underlying architecture of the archive is based on a non relational database, more precisely, on Apache Cassandra cluster, which uses a NoSQL technology. In order to test and validate the use of this architecture, we created a local archive which we populated with all the HARPS-N spectra collected at the TNG since the instrument's start of operations in mid-2012, as well as developed tools for the analysis of this data set. The HARPS-N data set is two orders of magnitude smaller than WEAVE, but we want to demonstrate the ability to walk through a complete data set and produce scientific output, as valuable as that produced by an ordinary pipeline, though without accessing directly the FITS files. The analytics is done by Apache Solr and Spark and on a relational PostgreSQL database. As an example, we produce observables like metallicity indexes for the targets in the archive and compare the results with the ones coming from the HARPS-N regular data reduction software. The aim of this experiment is to explore the viability of a high availability cluster and distributed NoSQL database as a platform for complex scientific analytics on a large data set, which will then be ported to the WEAVE Archive System (WAS) which we are developing for the WEAVE multi object, fiber spectrograph.
机译:TNG-INAF正在为WEAVE仪器开发科学档案。存档的基础架构基于非关系数据库,更确切地说,基于使用NoSQL技术的Apache Cassandra集群。为了测试和验证这种架构的使用,我们创建了一个本地档案,其中填充了自TNG在2012年中期开始运行以来在TNG收集的所有HARPS-N光谱,以及开发的分析工具该数据集。 HARPS-N数据集比WEAVE小两个数量级,但我们想证明具有遍历完整数据集并产生科学输出的能力,与普通管道产生的输出一样有价值,尽管无需直接访问FITS文件。该分析由Apache Solr和Spark以及关系PostgreSQL数据库完成。例如,我们为存档中的目标生成可观察性(如金属度指数),并将结果与​​HARPS-N常规数据归约软件得出的结果进行比较。本实验的目的是探索高可用性集群和分布式NoSQL数据库的可行性,以此作为对大数据集进行复杂科学分析的平台,然后将其移植到我们正在开发的WEAVE存档系统(WAS)中WEAVE多对象光纤光谱仪。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号