【24h】

Performance model for OpenMP parallelized loops

机译:OpenMP并行化环的性能模型

获取原文

摘要

OpenMP is one of the most widely used parallel programming techniques in modern multi-core era. Parallelizing a loop using OpenMP is just as simple as adding a few directive sentences. However, for its simplicity, it is not rare that programmers excessively use OpenMP to parallelize loops in various applications which introduce too much overhead and lead to performance degradation. This paper establishes a performance model for OpenMP parallelized loops to address the critical factors which influence the performance. The model is validated through experiments on three different multi-core platforms. The results shows that best performance can be obtained when number of threads used in OpenMP applications equals to the number of cores that available on the platform. And parallelizing the outmost loop in nested loops can get higher speedup.
机译:OpenMP是现代多核ERA中使用的最广泛使用的并行编程技术之一。 使用OpenMP并行化循环与添加一些指令句子一样简单。 然而,为了简单,程序员过度使用OpenMP并将循环平行于介绍过多的开销并导致性能下降,并不罕见。 本文建立了OpenMP并行化环路的性能模型,以解决影响性能的关键因素。 该模型通过三个不同的多核平台上的实验验证。 结果表明,当OpenMP应用程序中使用的线程等于平台上可用的核心数时,可以获得最佳性能。 并并行化嵌套循环中的最外面的循环可以获得更高的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号