首页> 外文会议>European PVM/MPI Users' Group Meeting >Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs
【24h】

Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs

机译:嵌套循环算法的高级混合MPI / OpenMP并行化范例在SMPS集群中

获取原文

摘要

The parallelization process of nested-loop algorithms onto popular multi-level parallel architectures, such as clusters of SMPs, is not a trivial issue, since the existence of data dependencies in the algorithm impose severe restrictions on the task decomposition to be applied. In this paper we propose three techniques for the parallelization of such algorithms, namely pure MPI parallelization, fine-grain hybrid MPI/OpenMP parallelization and coarse-grain MPI/OpenMP parallelization. We further apply an advanced hyperplane scheduling scheme that enables pipelined execution and the overlapping of communication with useful computation, thus leading almost to full CPU utilization. We implement the three variations and perform a number of micro-kernel benchmarks to verify the intuition that the hybrid programming model could potentially exploit the characteristics of an SMP cluster more efficiently than the pure message-passing programming model. We conclude that the overall performance for each model is both application and hardware dependent, and propose some directions for the efficiency improvement of the hybrid model.
机译:嵌套循环算法的并行化过程在流行的多级并行架构(如SMPS集群)上不是一个简单的问题,因为算法中的数据依赖性的存在对要应用的任务分解产生严重限制。在本文中,我们提出了三种用于这种算法的并行化的技术,即纯MPI并行化,细粒杂交MPI / Oppsmp并行化和粗粒MPI / OpenMP并行化。我们进一步应用了一个高级的超平面调度方案,它能够流水线执行和与有用计算的通信重叠,从而实现几乎完全的CPU利用率。我们实现了三种变体并执行了许多微内核基准测试,以验证混合编程模型可能比纯粹消息传递编程模型更有效地利用SMP群集的特征的直觉。我们得出结论,每个模型的整体性能都是应用和硬件依赖性,并提出了一些方向,以实现混合模型的效率改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号