MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives

机译：MPI对多核体系结构的支持：优化的共享内存集合

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

With local core counts on the rise, taking advantage of shared-memory to optimize collective operations can improve performance. We study several on-host shared memory optimized algorithms for MPI_Bcast, MPLReduce, and MPI_Allreduce, using tree-based, and reduce-scatter algorithms. For small data operations with relatively large synchronization costs fan-in/fan-out algorithms generally perform best. For large messages data manipulation constitute the largest cost and reduce-scatter algorithms are best for reductions. These optimization improve performance by up to a factor of three. Memory and cache sharing effect require deliberate process layout and careful radix selection for tree-based methods.

机译：随着本地核心数量的增加，利用共享内存来优化集体操作可以提高性能。我们使用基于树和减少分散的算法研究了几种针对MPI_Bcast，MPLReduce和MPI_Allreduce的主机上共享内存优化算法。对于具有相对较大同步成本的小数据操作，扇入/扇出算法通常效果最佳。对于大消息，数据处理构成了最大的成本，而减少分散算法最适合于减少。这些优化将性能提高了三倍。内存和缓存共享效果需要基于树的方法进行仔细的进程布局和仔细的基数选择。

著录项

来源
《Recent Advances in Parallel Virtual Machine and Message Passing Interface》|2008年|P.130-140|共11页
会议地点 Dublin(IE)
作者
Richard L. Graham; Galen Shipman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类分布式操作系统、并行式操作系统;
关键词
collectives; shared-memory; mpi_bcast; mpi_reduce; mpi_allreduce;

机译：集体;共享内存; mpi_bcast; mpi_reduce; mpi_allreduce;

相似文献

外文文献
中文文献
专利

1. Redesigning MPI shared memory communication for large multi-core architecture [J] . Miao Luo, Hao Wang, Jerome Vienne, Computer science . 2013,第2a3期

机译：重新设计用于大型多核体系结构的MPI共享内存通信
2. Redesigning MPI shared memory communication for large multi-core architecture [J] . Miao Luo, Hao Wang, Jerome Vienne, Computer Science - Research and Development . 2013,第2a3期

机译：重新设计用于大型多核体系结构的MPI共享内存通信
3. Architectural support for efficient message passing on shared memory multi-cores [J] . Ruben Titos-Gil, Oscar Palomar, Osman Unsal, Journal of Parallel and Distributed Computing . 2016,第sepa期

机译：对共享内存多核上有效消息传递的体系结构支持
4. MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives [C] . Richard L. Graham, Galen Shipman Europen PVM/MPI Users' Group Meeting . 2008

机译：MPI支持多核架构：优化共享内存集集团
5. Optimizing multi-dimensional MPI communications on multi-core architectures. [D] . Karlsson, Christer. 2012

机译：在多核体系结构上优化多维MPI通信。
6. Performance of parallel FDTD method for shared- and distributed-memory architectures: Application tobioelectromagnetics [O] . Miguel Ruiz-Cabello N., Maksims Abaļenkovs, Luis M. Diaz Angulo, 2020

机译：共享和分布式内存架构并行FDTD方法的性能：应用脚踏电磁
7. Optimizing MPI One Sided Communication on Multi-core InfiniBand Clusters using Shared Memory Backed Windows [O] . Sreeram Potluri, Hao Wang, Vijay Dhanraj, 2013

机译：使用共享内存支持Windows优化多核InfiniBand群集上的mpI单面通信
8. MPI Support for Multi-Core Architectures: Optimized Shared Memory Collectives. [R] . Graham, R. L., Shipman, G. 2013

机译：mpI支持多核架构：优化共享内存集合。

MPI Support for Multi-core Architectures: Optimized Shared Memory Collectives

摘要

著录项

相似文献

相关主题

期刊订阅