首页> 外文会议>2010 IEEE International Symposium on Parallel amp; Distributed Processing (IPDPS) >Structuring the execution of OpenMP applications for multicore architectures
【24h】

Structuring the execution of OpenMP applications for multicore architectures

机译:为多核体系结构构建OpenMP应用程序的执行

获取原文
获取原文并翻译 | 示例

摘要

The now commonplace multi-core chips have introduced, by design, a deep hierarchy of memory and cache banks within parallel computers as a tradeoff between the user friendliness of shared memory on the one side, and memory access scalability and efficiency on the other side. However, to get high performance out of such machines requires a dynamic mapping of application tasks and data onto the underlying architecture. Moreover, depending on the application behavior, this mapping should favor cache affinity, memory bandwidth, computation synchrony, or a combination of these. The great challenge is then to perform this hardware-dependent mapping in a portable, abstract way. To meet this need, we propose a new, hierarchical approach to the execution of OpenMP threads onto multicore machines. Our ForestGOMP runtime system dynamically generates structured trees out of OpenMP programs. It collects relationship information about threads and data as well. This information is used together with scheduling hints and hardware counter feedback by the scheduler to select the most appropriate threads and data distribution. ForestGOMP features a highlevel platform for developing and tuning portable threads schedulers. We present several applications for which we developed specific scheduling policies that achieve excellent speedups on 16-core machines.
机译:通过设计,现在常见的多核芯片已经在并行计算机中引入了一个深层次的内存和高速缓存库,这是一方面共享内存的用户友好性与另一方面访问内存的可扩展性和效率之间的权衡。但是,要想从这类机器中获得高性能,就需要将应用程序任务和数据动态映射到基础架构上。此外,根据应用程序的行为,此映射应优先考虑缓存亲和力,内存带宽,计算同步或这些的组合。那么,最大的挑战就是以一种可移植的抽象方式执行这种与硬件相关的映射。为了满足此需求,我们提出了一种新的,分层的方法来在多核计算机上执行OpenMP线程。我们的ForestGOMP运行时系统从OpenMP程序动态生成结构化树。它还收集有关线程和数据的关系信息。调度程序将此信息与调度提示和硬件计数器反馈一起使用,以选择最合适的线程和数据分发。 ForestGOMP具有用于开发和调整便携式线程调度程序的高级平台。我们介绍了一些应用程序,针对这些应用程序我们开发了特定的调度策略,这些策略可在16核计算机上实现出色的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号