首页> 外文会议>International Workshop on OpenMP >Towards Unifying OpenMP Under the Task-Parallel Paradigm Implementation and Performance of the taskloop Construct
【24h】

Towards Unifying OpenMP Under the Task-Parallel Paradigm Implementation and Performance of the taskloop Construct

机译:在任务并行范例实现和任务循环构造性能下实现OpenMP统一

获取原文

摘要

OpenMP 4.5 introduced a task-parallel version of the classical thread-parallel for-loop construct: the taskloop construct. With this new construct, programmers are given the opportunity to choose between the two parallel paradigms to parallelize their for loops. However, it is unclear where and when the two approaches should be used when writing efficient parallel applications. In this paper, we explore the taskloop construct. We study performance differences between traditional thread-parallel for loops and the new taskloop directive. We introduce an efficient implementation and compare our implementation to other taskloop implementations using micro- and kernel-benchmarks, as well as an application. We show that our taskloop implementation on average results in a 3.2 % increase in peak performance when compared against corresponding parallel-for loops.
机译:OpenMP 4.5引入了经典线程循环构造的任务 - 并行版本:TaskLoop构造。通过这个新的构造,程序员有机会在两个并行范式之间选择,以并行化他们的循环。但是,目前尚不清楚在写入有效的并行应用时应使用两种方法。在本文中,我们探索了TaskLoop构造。我们研究传统的线程与循环和新TaskLoop指令之间的性能差异。我们介绍了一个有效的实现,并将我们的实现与其他任务的实现,使用微型和内核基准以及应用程序以及应用程序。我们表明,与相应的并行循环相比,我们的TaskLoop实现平均结果达到3.2%的峰值性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号