首页> 外文会议>IEEE International Conference on Cluster Computing >LVFS: A scalable big data scientific storage system
【24h】

LVFS: A scalable big data scientific storage system

机译:LVFS:可扩展的大数据科学存储系统

获取原文
获取外文期刊封面目录资料

摘要

LVFS is a virtual scalable file storage system developed in response to a class of scientific data systems that over time continue to collect petabytes of data that begin to seriously impact the response time to user request services. The system has been operational in a real use case, the NASA MODIS Adaptive Processing System (MODAPS), and shown to double data throughput compared to the original system thanks to better performance and easier load balancing. The MODAPS operational life has been extended over a decade as of now and contains over four petabytes of data in over billions of files on over 500 different disks attached to multiple storage nodes. MODAPS is the processing system for delivering calibrated Level 1 data from MODIS instruments on two NASA satellites, each containing 36 channel multi-spectral visible and infrared changes launched over a decade ago. These system's life cycle operations are typical of many scientific instruments and experiments that continue to generate useful archival data well beyond their originial expected lifetime capabilities to meet current scientific user needs. The Level 1 Atmosphere Archive and Distribution System (LAADS) is responsible for distribution of products produced by MODAPS. The LAADS Virtual File System (LVFS) has now replaced parts of LAADS and is responsible for the read only distribution of all LAADS data to the public. In this paper, we describe the unique design of LVFS and, additionally, describe our ongoing work to incorporate a Distributed Hash-based architecture into the LVFS design to transform LVFS into a full scientific storage architecture scalable to Exabyte sizes.
机译:LVFS是为响应一类科学数据系统而开发的虚拟可扩展文件存储系统,随着时间的推移,该类科学数据系统会继续收集PB级的数据,这些数据开始严重影响对用户请求服务的响应时间。该系统已在实际使用案例中运行,即NASA MODIS自适应处理系统(MODAPS),并且由于具有更好的性能和更轻松的负载平衡,因此与原始系统相比,其数据吞吐量翻了一番。截止到现在,MODAPS的使用寿命已延长了十年,并且在连接到多个存储节点的500多个不同磁盘上的数十亿个文件中包含了超过4 PB的数据。 MODAPS是一种处理系统,用于从两颗NASA卫星上的MODIS仪器中传输经过校准的1级数据,每颗卫星均包含十年前发射的36道多光谱可见光和红外光变化。这些系统的生命周期操作是许多科学仪器和实验的典型代表,它们继续产生有用的档案数据,远远超出了其最初的预期寿命能力,无法满足当前的科学用户需求。 1级大气档案和分发系统(LAADS)负责分发MODAPS生产的产品。 LAADS虚拟文件系统(LVFS)现在已替换了LAADS的一部分,并负责将所有LAADS数据以只读方式分发给公众。在本文中,我们描述了LVFS的独特设计,此外,还描述了我们正在进行的工作,以将基于分布式哈希的架构整合到LVFS设计中,以将LVFS转换为可扩展至Exabyte大小的完整科学存储架构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号