...
首页> 外文期刊>ACM Transactions on Architecture and Code Optimization >DawnCC: Automatic Annotation for Data Parallelism and Offloading
【24h】

DawnCC: Automatic Annotation for Data Parallelism and Offloading

机译:Dawncc:数据并行和卸载的自动注释

获取原文
获取原文并翻译 | 示例

摘要

Directive-based programming models, such as OpenACC and OpenMP, allow developers to convert a sequential program into a parallel one with minimum human intervention. However, inserting pragmas into production code is a difficult and error-prone task, often requiring familiarity with the target program. This difficulty restricts the ability of developers to annotate code that they have not written themselves. This article provides a suite of compiler-related methods to mitigate this problem. Such techniques rely on symbolic range analysis, a well-known static technique, to achieve two purposes: populate source code with data transfer primitives and to disambiguate pointers that could hinder automatic parallelization due to aliasing. We have materialized our ideas into a tool, DawnCC, which can be used stand-alone or through an online interface. To demonstrate its effectiveness, we show how DawnCC can annotate the programs available in PolyBench without any intervention from users. Such annotations lead to speedups of over 100x in an Nvidia architecture and over 50x in an ARM architecture.
机译:基于指令的编程模型(如OpenACC和OpenMP)允许开发人员将连续程序转换为平行的程序,并具有最小的人为干预。但是,将Pragmas插入生产代码是一个困难且易于出错的任务,通常需要熟悉目标程序。这种困难限制了开发人员向他们没有写入的代码的能力。本文提供了一套与编译器相关的方法来缓解此问题。这种技术依赖于符号范围分析,一种众所周知的静态技术,实现两个目的:填充数据传输原语的源代码并消除可能因别名而阻碍自动并行化的指针。我们已经将我们的想法融入了一个工具,即可使用独立或通过在线界面使用。为了展示其有效性,我们展示了Dawncc如何向邻居提供的节目提供注释,而无需用户任何干预。这种注释导致NVIDIA架构中超过100倍的加速,并且在ARM架构中超过50倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号