首页> 外文期刊>Operating systems review >Recovering Transient Data: Automated On-demand Data Reconstruction and Offloading for Supercomputers
【24h】

Recovering Transient Data: Automated On-demand Data Reconstruction and Offloading for Supercomputers

机译:恢复瞬态数据:超级计算机的自动按需数据重构和卸载

获取原文
获取原文并翻译 | 示例
           

摘要

It has become a national priority to build and use PetaFlop supercomputers. The dependability of such large systems has been recognized as a key issue that can impact their usability. Even with smaller, existing machines, failures are the norm rather than an exception. Research has shown that storage systems are the primary source of faults leading to supercomputer unavailability. In this paper, we envision two mechanisms, namely on-demand data reconstruction and eager data offloading, to address the availability of job input/output data. These two techniques aim to allow parallel jobs and post-job processing tools to continue execution despite storage system failures in supercomputers. Fundamental to both approaches is the definition and acquisition of recovery-related parallel file system metadata, which is then coupled with transparent remote data accesses. Our approach attempts to maximize the utilization of precious supercomputer resources by improving the accessibility of transient job data. Further, the proposed methods are best-effort in nature and complement existing file system recovery schemes, which are designed for persistent data. Several of our previous studies help in demonstrating the feasibility of the proposed approaches.
机译:构建和使用PetaFlop超级计算机已成为国家的优先事项。如此大型系统的可靠性已被认为是可能影响其可用性的关键问题。即使使用较小的现有机器,故障也是正常现象,而非例外。研究表明,存储系统是导致超级计算机不可用的故障的主要来源。在本文中,我们设想了两种机制,即按需数据重构和紧急数据卸载,以解决作业输入/输出数据的可用性。这两种技术旨在允许并行作业和作业后处理工具继续执行,即使超级计算机中的存储系统出现故障。这两种方法的基础都是定义和获取与恢复相关的并行文件系统元数据,然后将其与透明的远程数据访问结合在一起。我们的方法试图通过改善临时作业数据的可访问性来最大程度地利用宝贵的超级计算机资源。此外,所提出的方法本质上是尽力而为,并补充了为持久性数据而设计的现有文件系统恢复方案。我们之前的几项研究有助于证明所提出方法的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号