首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Key/Value-enabled Flash Memory for Complex Scientific Workflows with On-line Analysis and Visualization
【24h】

Key/Value-enabled Flash Memory for Complex Scientific Workflows with On-line Analysis and Visualization

机译:支持键/值的闪存,用于复杂科学工作流程,具有在线分析和可视化

获取原文

摘要

Scientific workflows are often composed of compute-intensive simulations and data-intensive analysis and visualization, both equally important for productivity. High-performance computers run the compute-intensive phases efficiently, but data-intensive processing is still getting less attention. Dense non-volatile memory integrated into super-computers can help address this problem. In addition to density, it offers significantly finer-grained I/O than disk-based I/O systems. We present a way to exploit the fundamental capabilities of Storage-Class Memories (SCM), such as Flash, by using scalable key-value (KV) I/O methods instead of traditional file I/O calls commonly used in HPC systems. Our objective is to enable higher performance for on-line and near-line storage for analysis and visualization of very high resolution, but correspondingly transient, simulation results. In this paper, we describe 1) the adaptation of a scalable key-value store to a BlueGene/Q system with integrated Flash memory, 2) a novel key-value aggregation module which implements coalesced, function-shipped calls between the clients and the servers, and 3) the refactoring of a scientific workflow to use application-relevant keys for fine-grained data subsets. The resulting implementation is analogous to function-shipping of POSIX I/O calls but shows an order of magnitude increase in read and a factor 2.5x increase in write IOPS performance (11 million read IOPS; 2.5 million write IOPS from 4096 compute nodes) when compared to a classical file system on the same system. It represents an innovative approach for the integration of SCM within an HPC system at scale.
机译:科学工作流程通常由计算密集型模拟和数据密集型分析和可视化组成,两者都与生产力同样重要。高性能计算机有效运行计算密集型阶段,但数据密集型处理仍然不那么关注。集成到超级计算机中的密集非易失性内存可以帮助解决此问题。除了密度外,它还提供比基于磁盘的I / O系统更精细的I / O.我们通过使用可伸缩的键值(kV)I / O方法而不是常用于HPC系统的传统文件I / O呼叫,提出了一种方法来利用存储级存储器(SCM)(如闪存)的基本功能,例如闪存。我们的目标是为在线和近线存储的良好性能,用于分析和可视化非常高分辨率,但相应的瞬态模拟结果。在本文中,我们描述了1)将可伸缩键值存储的适应与集成闪存的蓝色/ Q系统,2)实现CONALESCED的键合聚合模块,包括客户端和服务器,以及3)科学工作流程的重构,用于使用应用相关键进行细粒度的数据子集。由此产生的实现类似于POSIX I / O呼叫的功能运输,但显示了读取的数量级和写入IOPS性能增加25倍(1100万读取IOPS; 250万个写入IOPS)的因素与同一系统上的古典文件系统相比。它代表了一种以规模在HPC系统中集成的创新方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号