首页> 外文期刊>Future generation computer systems >Measuring the impact of burst buffers on data-intensive scientific workflows
【24h】

Measuring the impact of burst buffers on data-intensive scientific workflows

机译:测量突发缓冲区对数据密集型科学工作流程的影响

获取原文
获取原文并翻译 | 示例
       

摘要

Science applications frequently produce and consume large volumes of data, but delivering this data to and from compute resources can be challenging, as parallel file system performance is not keeping up with compute and memory performance. To mitigate this I/O bottleneck, some systems have deployed burst buffers, but their impact on performance for real-world scientific workflow applications is still not clear. In this paper, we examine the impact of burst buffers through the remote-shared, allocatable burst buffers on the Cori system at NERSC. By running two data-intensive workflows, a high-throughput genome analysis workflow, and a subset of the SCEC high-performance CyberShake workflow, a production seismic hazard analysis workflow, we find that using burst buffers offers read and write improvements of an order of magnitude, and these improvements lead to increased job performance, and thereby increased overall workflow performance, even for long-running CPU-bound jobs. (C) 2019 Elsevier B.V. All rights reserved.
机译:科学应用程序经常产生和使用大量数据,但是由于并行文件系统的性能无法跟上计算和内存的性能,因此向计算资源和从计算资源传递数据可能具有挑战性。为了缓解此I / O瓶颈,某些系统已部署了突发缓冲区,但是它们对实际科学工作流应用程序的性能影响尚不清楚。在本文中,我们研究了通过远程共享的可分配突发缓冲区对NERSC的Cori系统的冲击缓冲区的影响。通过运行两个数据密集型工作流程,一个高通量基因组分析工作流程以及SCEC高性能Cyber​​Shake工作流程的一个子集(生产地震危险分析工作流程),我们发现使用突发缓冲区可以对读和写顺序进行改善。幅度,这些改进可以提高作业性能,从而提高整体工作流程性能,即使对于长时间运行的CPU作业也是如此。 (C)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号