...
首页> 外文期刊>International journal of parallel programming >Hierarchical Read-Write Optimizations for Scientific Applications with Multi-variable Structured Datasets
【24h】

Hierarchical Read-Write Optimizations for Scientific Applications with Multi-variable Structured Datasets

机译:具有多变量结构化数据集的科学应用的分层读写优化

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Large-scale scientific applications spend a significant amount of time in reading and writing data. These simulations run on supercomputers which are archi-tected with high-bandwidth, low-latency, and complex topology interconnects. Yet, few efforts exist that fully exploit the interconnect features for I/O. MPI-IO optimizations suffer from significant network contention at large core counts making I/O a critical bottleneck at extreme scales. We propose HieRO, which leverages the fast interconnect and performs hierarchical optimizations for I/O in scientific applications with structured datasets. HieRO performs reads/writes in multiple stages using carefully chosen leader processes who invoke the MPI-IO calls. Additionally, HieRO considers the application's domain decomposition and access patterns and fully utilizes the on-chip interconnect at each multicore node. We evaluate the efficacy of our optimizations with two scientific applications, WRF and S3D, with I/O access patterns commonly used in a wide gamut of applications. We evaluate our approaches on two supercomputers, the Edison Cray XC30 and the Mira Blue Gene/Q, representing systems with diverse interconnects and parallel filesystems. We demonstrate that algorithmic changes can lead to significant improvements in parallel read/write. HieRO is able to achieve more than 40× read time improvements for WRF and achieve up to 40× read and 13× write time improvements for S3D on 524288 cores.
机译:大型科学应用程序在读取和写入数据上花费大量时间。这些模拟在具有高带宽,低延迟和复杂拓扑互连的超级计算机上运行。然而,很少有努力充分利用I / O的互连功能。 MPI-IO优化在大量核心数量时遭受重大网络争用,这使得I / O成为极端规模的关键瓶颈。我们提出了HieRO,它利用快速互连并在具有结构化数据集的科学应用中对I / O进行分层优化。 HieRO使用精挑细选的领导程序执行MPI-IO调用,分多个阶段执行读/写操作。此外,HieRO还考虑了应用程序的域分解和访问模式,并充分利用了每个多核节点上的片上互连。我们使用两种科学应用程序WRF和S3D以及广泛用于各种应用程序的I / O访问模式来评估优化的有效性。我们在两个超级计算机,即Edison Cray XC30和Mira Blue Gene / Q上评估了我们的方法,这两个系统代表了具有不同互连和并行文件系统的系统。我们证明了算法的变化可以导致并行读/写的显着改善。 HieRO能够为WRF实现40倍以上的读取时间改进,并为524288个内核上的S3D实现40倍的读取时间和13倍的写入时间改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号