首页> 外文会议>International Conference on High Performance Computing Simulation >In search of the best MPI-OpenMP distribution for optimum Intel-MIC cluster performance
【24h】

In search of the best MPI-OpenMP distribution for optimum Intel-MIC cluster performance

机译:寻找最佳的MPI-OpenMP分发,以实现最佳英特尔-MIC集群性能

获取原文
获取外文期刊封面目录资料

摘要

Applications for HPC platforms are mainly based on hybrid programming models: MPI for communication and OpenMP for task and fork-join parallelism to exploit shared memory communication inside a node. On the basis of this scheme, much research has been carried out to improve performance. Some examples are: the overlap of communication and computation, or the increase of speedup and bandwidth on new network fabrics (i.e. Infiniband and 10GB or 40GB ethernet). Henceforth, as far as computation and communication are concerned, the HPC platforms will be heterogeneous with high-speed networks. And, in this context, an important issue is to decide how to distribute the workload among all the nodes in order to balance the application execution as well as choosing the most appropriate programming model to exploit parallelism inside the node. In this paper we propose a mechanism to balance dynamically the work distribution among the heterogeneous components of an heterogeneous cluster based on their performance characteristics. For our evaluations we run the miniFE mini-application of the Mantevo suite benchmark, in a heterogeneous Intel MIC cluster. Experimental results show that making an effort to choose the appropriate number of threads can improve performance significantly over choosing the maximum available number of cores in the Intel MIC.
机译:HPC平台的应用主要基于混合编程模型:用于任务和Fork-JoinParpartich的通信和OpenMP的MPI,以利用节点内的共享内存通信。在此计划的基础上,已经进行了许多研究以提高性能。一些示例是:通信和计算重叠,或在新网络结构上的加速和带宽的增加(即Infiniband和10GB或40GB以太网)。从此,就计算和通信而言,HPC平台将具有高速网络的异构性。并且,在此上下文中,重要问题是决定如何在所有节点之间分发工作负载,以便平衡应用程序执行以及选择最合适的编程模型以利用节点内的并行性。在本文中,我们提出了一种基于它们的性能特征来动态地平衡异构集群的异构组分之间的工作分布。对于我们的评估,我们在异构英特尔麦克风群中运行Minife Mini-Application的Mantevo Suite基准。实验结果表明,努力选择适当数量的线程可以在选择英特尔MIC中选择最大可用数量的核心来显着提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号