...
首页> 外文期刊>EPJ Web of Conferences >Bibliography, catalogs, pixel data: Management of heterogeneous Big Data at CDS by the documentalists
【24h】

Bibliography, catalogs, pixel data: Management of heterogeneous Big Data at CDS by the documentalists

机译:书目,目录,像素数据:记录员在CDS上管理异构大数据

获取原文

摘要

High speed Internet and the evolution of data storage space in terms of cost-effectiveness has changed the way data are managed today. Large amounts of heterogeneous data can now be visualized easily and rapidly using interactive applications such as “Google Maps”. In this respect, the Hierarchical Progressive Survey (HiPS) method has been developed by the Centre de Données astronomiques de Strasbourg (CDS) since 2009. HiPS uses the hierarchical sky tessellation called HEALPix to describe and organize images, data cubes or source catalogs. These HiPS can be accessed and visualized using applications such as Aladin. We show that structuring the data using HiPS enables easy and quick access to large and complex sets of astronomical data. As with bibliographic and catalog data, full documentation and comprehensive metadata are absolutely required for pertinent usage of these data. Hence the role of documentalists in the process of producing HiPS is essential. We present the interaction between documentalists and other specialists who are all part of the CDS team and support this process. More precisely, we describe the tools used by the documentalists to generate HiPS or to update the Virtual Observatory standardized descriptive information (the “metadata”). We also present the challenges faced by the documentalists processing such heterogeneous data on the scales of megabytes up to petabytes. On one hand, documentalists at CDS manage small size textual or numerical data for one or few astronomical objects. On the other hand, they process large data sets such as big catalogs containing heterogeneous data like spectra, images or data cubes, for millions of astronomical objects. Finally, by participating in the development of an interactive visualization of images or three-dimensional data cubes using the HiPS method, documentalists contribute to a long-term management of complex, large astronomical data.
机译:高速互联网和数据存储空间在成本效益方面的发展改变了如今管理数据的方式。现在,可以使用诸如“ Google Maps”之类的交互式应用程序轻松快速地查看大量异构数据。在这方面,斯特拉斯堡天文学中心(CDS)自2009年以来就开发了分层递进勘测(HiPS)方法。HiPS使用称为HEALPix的分层空中细分来描述和组织图像,数据立方体或源目录。这些HiPS可以使用诸如Aladin之类的应用程序进行访问和可视化。我们表明,使用HiPS构造数据可以轻松,快速地访问大型和复杂的天文数据集。与书目和目录数据一样,要正确使用这些数据,绝对需要完整的文档和全面的元数据。因此,记录员在生产HiPS过程中的作用至关重要。我们介绍了纪录片专家与CDS团队中所有其他专家之间的互动,并支持这一过程。更准确地说,我们描述了文献工作者用来生成HiPS或更新虚拟天文台标准化描述性信息(“元数据”)的工具。我们还介绍了记录员在兆字节至PB规模上处理此类异构数据所面临的挑战。一方面,CDS的文献工作者管理着一个或几个天文物体的小尺寸文本或数字数据。另一方面,它们处理大型数据集,例如包含数百万个天文物体的异类数据(如光谱,图像或数据立方体)的大目录。最后,通过参与使用HiPS方法对图像或三维数据立方体进行交互式可视化的开发,文献工作者可以对复杂的大型天文数据进行长期管理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号