首页> 外文学位 >Code transformations to augment the scope of loop fusion in a production compiler.
【24h】

Code transformations to augment the scope of loop fusion in a production compiler.

机译:代码转换以扩大生产编译器中循环融合的范围。

获取原文
获取原文并翻译 | 示例

摘要

Loop fusion is a common optimization technique that takes two loops and combines them into a single large loop. Most of the existing loop fusion techniques focus on heuristics to optimize an objective function such as data reuse within the fused loop or parallelizability of the fused loop. Most programs have only a small number of loops that satisfy all of the conditions necessary for loop fusion to occur. This thesis identifies conditions that prevent loop fusion and focuses on code transformations applied to sets of loops to make these conditions satisfiable. It also presents algorithms to fuse loops in the IBM®XL compiler suite that generates code for the IBM family of PowerPC®processors. This compiler uses a heuristic in the loop distributor to ultimately determine which portions of a loop should remain in the same loop nest and which portions should be moved into a different loop nest. Thus, in order for the loop distributor to make the best possible decisions, maximal loop fusion is performed first to make the loop bodies as large as possible. The large loop bodies created by loop fusion can also benefit other loop optimizations which are run after loop fusion and before loop distribution. All of the algorithms have been implemented in the IBM XL compiler framework and tested on an IBM pSeries™ 630 machine equipped with a POWER4™ processor using the SPEC95 and SPEC2000 benchmark suites.
机译:循环融合是一种常见的优化技术,它采用两个循环并将它们组合为一个大循环。现有的大多数循环融合技术都集中在启发式算法上,以优化目标函数,例如融合环内的数据重用或融合环的可并行性。大多数程序只有少数几个循环,这些循环满足发生循环融合所需的所有条件。本文确定了防止循环融合的条件,并着重于应用于循环集的代码转换以使这些条件可满足。它还介绍了融合在IBM ® XL编译器套件中的循环的算法,该编译器套件为IBM PowerPC ®处理器系列生成代码。该编译器在循环分配器中使用试探法来最终确定循环的哪些部分应保留在同一循环嵌套中,以及哪些部分应移入不同的循环嵌套中。因此,为了使环路分配器做出最佳的决策,首先执行最大的环路融合,以使环路主体尽可能大。通过循环融合创建的大型循环体也可以使其他在循环融合之后和循环分发之前运行的循环优化受益。所有算法均已在IBM XL编译器框架中实现,并已在装有SPEC95和SPEC2000基准套件的配备POWER4™处理器的IBM pSeries™630计算机上进行了测试。

著录项

  • 作者

    Barton, Christopher Mark.;

  • 作者单位

    University of Alberta (Canada).;

  • 授予单位 University of Alberta (Canada).;
  • 学科 Computer Science.
  • 学位 M.Sc.
  • 年度 2003
  • 页码 102 p.
  • 总页数 102
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 自动化技术、计算机技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号