首页> 外文会议>Recent Advances in Parallel Virtual Machine and Message Passing Interface >Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs
【24h】

Advanced Hybrid MPI/OpenMP Parallelization Paradigms for Nested Loop Algorithms onto Clusters of SMPs

机译:用于SMP群集上的嵌套循环算法的高级混合MPI / OpenMP并行化范例

获取原文

摘要

The parallelization process of nested-loop algorithms onto popular multi-level parallel architectures, such as clusters of SMPs, is not a trivial issue, since the existence of data dependencies in the algorithm impose severe restrictions on the task decomposition to be applied. In this paper we propose three techniques for the parallelization of such algorithms, namely pure MPI parallelization, fine-grain hybrid MPI/OpenMP parallelization and coarse-grain MPI/OpenMP parallelization. We further apply an advanced hyperplane scheduling scheme that enables pipelined execution and the overlapping of communication with useful computation, thus leading almost to full CPU utilization. We implement the three variations and perform a number of micro-kernel benchmarks to verify the intuition that the hybrid programming model could potentially exploit the characteristics of an SMP cluster more efficiently than the pure message-passing programming model. We conclude that the overall performance for each model is both application and hardware dependent, and propose some directions for the efficiency improvement of the hybrid model.
机译:嵌套循环算法在流行的多级并行体系结构(例如SMP集群)上的并行化过程并不是一个小问题,因为算法中数据依赖的存在对要应用的任务分解施加了严格的限制。在本文中,我们提出了三种用于此类算法并行化的技术,即纯MPI并行化,细粒度混合MPI / OpenMP并行化和粗粒度MPI / OpenMP并行化。我们进一步应用了高级超平面调度方案,该方案可实现流水线执行以及通信与有用计算的重叠,从而几乎导致了CPU的全部利用率。我们实现了这三种变体,并执行了许多微内核基准测试,以验证直觉性的观点,即混合编程模型比纯消息传递编程模型可以更有效地利用SMP集群的特性。我们得出的结论是,每种模型的总体性能都取决于应用程序和硬件,并为提高混合模型的效率提出了一些方向。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号