首页> 外文会议>Proceedings of the IEEE International Conference on Cluster Computing >Optimizing mechanisms for latency tolerance in remote memory access communication on clusters
【24h】

Optimizing mechanisms for latency tolerance in remote memory access communication on clusters

机译:优化群集上远程内存访问通信中的延迟容限的机制

获取原文

摘要

This paper describes the design and implementation of mechanisms for latency tolerance in the remote memory access communication on clusters equipped with high-performance networks such as Myrinet. It discusses strategies that bridge the gap between user-level requirements and network-specific communication interfaces while attempting to increase opportunities for latency hiding. Mechanisms for overlapping communication with computation and coalescing small messages (trading latency for bandwidth) are explored. The effectiveness of these techniques is evaluated using microbenchmarks and application kernels including the NAS parallel benchmark suite. The microbenchmark results showed a better degree of overlap for nonblocking operations in ARMCI as compared to MPI. Application results showed up 30% to 45% improvement over MPI on using nonblocking operations. The aggregation of small messages yielded performance improvement of up to 78% over non-aggregated communication.
机译:本文介绍了在配备有高性能网络(如MyRinet)的群集中远程内存访问通信中延迟容差的机制的设计和实现。它讨论了展览用户级要求与网络特定通信接口之间差距的策略,同时尝试增加延迟隐藏的机会。探讨了与计算和聚结的重叠通信的机制(带宽的交易延迟)。使用包括NAS并联基准套件的微观发布和应用核来评估这些技术的有效性。与MPI相比,Microbenchmark结果表明,在ARMCI中的非阻塞操作具有更好的重叠。应用结果在使用非阻塞操作的MPI上提高了30%至45%。小消息的聚合产生了在非聚合通信中的绩效提高高达78%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号