首页> 外文OA文献 >What Multilevel Parallel Programs do when you are not Watching: A Performance Analysis Case Study Comparing MPI/OpenMP, MLP, and Nested OpenMP

【2h】

What Multilevel Parallel Programs do when you are not Watching: A Performance Analysis Case Study Comparing MPI/OpenMP, MLP, and Nested OpenMP

机译：不在观看时多级并行程序会做什么：比较MPI / OpenMP，MLP和嵌套OpenMP的性能分析案例研究

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

With the current trend in parallel computer architectures towards clusters of shared memory symmetric multi-processors, parallel programming techniques have evolved that support parallelism beyond a single level. When comparing the performance of applications based on different programming paradigms, it is important to differentiate between the influence of the programming model itself and other factors, such as implementation specific behavior of the operating system (OS) or architectural issues. Rewriting-a large scientific application in order to employ a new programming paradigms is usually a time consuming and error prone task. Before embarking on such an endeavor it is important to determine that there is really a gain that would not be possible with the current implementation. A detailed performance analysis is crucial to clarify these issues. The multilevel programming paradigms considered in this study are hybrid MPI/OpenMP, MLP, and nested OpenMP. The hybrid MPI/OpenMP approach is based on using MPI [7] for the coarse grained parallelization and OpenMP [9] for fine grained loop level parallelism. The MPI programming paradigm assumes a private address space for each process. Data is transferred by explicitly exchanging messages via calls to the MPI library. This model was originally designed for distributed memory architectures but is also suitable for shared memory systems. The second paradigm under consideration is MLP which was developed by Taft. The approach is similar to MPi/OpenMP, using a mix of coarse grain process level parallelization and loop level OpenMP parallelization. As it is the case with MPI, a private address space is assumed for each process. The MLP approach was developed for ccNUMA architectures and explicitly takes advantage of the availability of shared memory. A shared memory arena which is accessible by all processes is required. Communication is done by reading from and writing to the shared memory.

机译：随着当前并行计算机体系结构趋向于共享内存对称多处理器集群的趋势，并行编程技术得到了发展，该技术支持并行性已超出单个级别。在比较基于不同编程范例的应用程序的性能时，重要的是区分编程模型本身的影响和其他因素，例如操作系统（OS）的特定于实现的行为或体系结构问题。为了采用新的编程范例而进行的大型科学应用程序的重写通常是一项耗时且容易出错的任务。在着手这项工作之前，重要的是要确定确实有当前实施无法获得的收益。详细的性能分析对于阐明这些问题至关重要。本研究中考虑的多层编程范例是MPI / OpenMP，MLP和嵌套OpenMP混合。 MPI / OpenMP混合方法基于使用MPI [7]进行粗粒度并行化，使用OpenMP [9]进行细粒度循环级并行化。 MPI编程范例为每个进程假定一个专用地址空间。通过调用MPI库显式交换消息来传输数据。该模型最初是为分布式内存体系结构设计的，但也适用于共享内存系统。正在考虑的第二个范例是塔夫脱（Taft）开发的MLP。该方法类似于MPi / OpenMP，混合使用了粗粒度过程级并行化和循环级OpenMP并行化。与MPI一样，每个进程都假定有专用地址空间。 MLP方法是为ccNUMA体系结构开发的，并且明确利用了共享内存的可用性。需要所有进程都可以访问的共享内存空间。通过读取和写入共享内存来完成通信。

著录项

作者
Gimenez Judit; Jost Gabriele; Labarta Jesus;
展开▼
作者单位

展开▼
年度 2004
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. High performance computing for flood simulation using Telemac based on hybrid MPI/OpenMP parallel programming [J] . Zhi Shang International journal of modeling, simulation and scientific computing . 2014,第4期

机译：基于混合MPI / OpenMP并行编程的Telemac高性能洪水仿真计算
2. Performance-based parallel loop self-scheduling using hybrid OpenMP and MPI programming on multicore SMP clusters [J] . Chao-Tung Yang, Chao-Chin Wu, Jen-Hsiang Chang Concurrency, practice and experience . 2011,第8期

机译：在多核SMP集群上使用OpenMP和MPI混合编程进行基于性能的并行循环自调度
3. COMPARING OPENMP, HPF, AND MPI PROGRAMMING: A STUDY CASE [J] . Jean-Yves Berthou, Eric Fayolle Experimental Mechanics . 2001,第3期

机译：比较OPENMP，HPF和MPI编程：一个研究案例
4. What Multilevel Parallel Programs Do When You Are Not Watching: A Performance Analysis Case Study Comparing MPI/OpenMP, MLP, and Nested OpenMP [C] . Gabriele Jost, Jesus Labarta, Judit Gimenez International Workshop on OpenMP Applications and Tools . 2005

机译：当您未观看时，多级并行程序会如何：绩效分析案例研究比较MPI / OpenMP，MLP和嵌套OpenMP
5. Performance analysis of pure MPI versus MPI+OpenMP for Jacobi Iteration and a three-dimensional FFT on the Cray XT5. [D] . Weiss, Olga. 2012

机译：纯CPI与MPI + OpenMP进行Jacobi迭代和在Cray XT5上进行三维FFT的性能分析。
6. High Performance Data Clustering: A Comparative Analysis of Performance for GPU RASC MPI and OpenMP Implementations [O] . Luobin Yang, Steve C. Chiu, Wei-Keng Liao, -1

机译：高性能数据集群：GPURASCMPI和OpenMP实现的性能比较分析
7. Exploiting Fine-Grained Parallelism in the Phylogenetic Likelihood Function with MPI, Pthreads, and OpenMP: A Performance Study [O] . Ros Stamatakis, Michael Ott 2014

机译：利用mpI，pthreads和Openmp在系统发生似然函数中利用细粒度并行性：性能研究
8. What Multilevel Parallel Programs Do When You Are Not Watching: Performance Analysis Case Study Comparing MPI/OpenMP, MLP, and Nested OpenMP [R] . Jost, G. , Labarta, J. , Gimenez, J. 2004

机译：您不注意时多级并行程序执行的操作：性能分析案例研究比较mpI / Openmp，mLp和嵌套Openmp

What Multilevel Parallel Programs do when you are not Watching: A Performance Analysis Case Study Comparing MPI/OpenMP, MLP, and Nested OpenMP

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅