首页> 外文会议>Europen PVM/MPI Users' Group Meeting >MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives
【24h】

MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives

机译:MPI支持多核架构:优化共享内存集集团

获取原文

摘要

With local core counts on the rise, taking advantage of shared-memory to optimize collective operations can improve performance. We study several on-host shared memory optimized algorithms for MPI_Bcast, MPLReduce, and MPI_Allreduce, using tree-based, and reduce-scatter algorithms. For small data operations with relatively large synchronization costs fan-in/fan-out algorithms generally perform best. For large messages data manipulation constitute the largest cost and reduce-scatter algorithms are best for reductions. These optimization improve performance by up to a factor of three. Memory and cache sharing effect require deliberate process layout and careful radix selection for tree-based methods.
机译:随着本地核心的兴趣,利用共享内存来优化集体操作可以提高性能。我们使用基于树和缩小分布算法研究MPI_BCAST,MPLReduce和MPI_allReduce的几个On-Host共享内存优化算法。对于具有相对较大的同步的小数据操作,费用范围/扇出算法通常最佳。对于大型消息,数据操作构成最大成本,并且缩小散射算法最适合减少。这些优化将性能提高到三倍。内存和缓存共享效果需要刻意的过程布局和基于树的方法的仔细的基数选择。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号