首页> 外文会议>International conference on high performance computing >The Pitfalls of Provisioning Exascale Networks: A Trace Replay Analysis for Understanding Communication Performance
【24h】

The Pitfalls of Provisioning Exascale Networks: A Trace Replay Analysis for Understanding Communication Performance

机译:供应Exascale网络的陷阱:了解通信性能的跟踪重播分析

获取原文

摘要

Data movement is considered the main performance concern for exascale, including both on-node memory and off-node network communication. Indeed, many application traces show significant time spent in MPI calls, potentially indicating that faster networks must be provisioned for scalability. However, equating MPI times with network communication delays ignores synchronization delays and software overheads independent of network hardware. Using point-to-point protocol details, we explore the decomposition of MPI time into communication, synchronization and software stack components using architecture simulation. Detailed validation using Bayesian inference is used to identify the sensitivity of performance to specific latency/bandwidth parameters for different network protocols and to quantify associated uncertainties. The inference combined with trace replay shows that synchronization and MPI software stack overhead are at least as important as the network itself in determining time spent in communication routines.
机译:数据移动被视为百亿亿次存储的主要性能问题,包括节点上的内存和节点外的网络通信。实际上,许多应用程序跟踪显示在MPI调用上花费了大量时间,这可能表明必须为可伸缩性提供更快的网络。但是,将MPI时间与网络通信延迟等同起来会忽略同步延迟和独立于网络硬件的软件开销。使用点对点协议的详细信息,我们使用架构仿真来探索将MPI时间分解为通信,同步和软件堆栈组件的过程。使用贝叶斯推断的详细验证可用于识别性能对不同网络协议的特定延迟/带宽参数的敏感性,并量化相关的不确定性。推断与跟踪重播相结合表明,在确定通信例程所花费的时间时,同步和MPI软件堆栈的开销至少与网络本身一样重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号