首页> 外文OA文献 >Automatic code generation for executing tiled nested loops onto parallel architectures
【2h】

Automatic code generation for executing tiled nested loops onto parallel architectures

机译:自动代码生成,用于在并行体系结构上执行平铺嵌套循环

摘要

This paper presents a novel approach for the problem of generating tiled code for nested for-loops using a tiling transformation. Tiling or supernode transformation has been widely used to improve locality in multi-level memory hierarchies as well as to efficiently execute loops onto non-uniform memory access architectures. However, automatic code generation for tiled loops can be a very complex compiler work due to non-rectangular tile shapes and iteration space bounds. Our method considerably enhances previous work on rewriting tiled loops by considering parallelepiped tiles and arbitrary iteration space shapes. The complexity of code generation for tiling transformation is now reduced to the complexity of code generation for any linear transformation. Experimental results which compare all so far presented approaches, show that the proposed approach for generating tiled code is significantly accelerated.
机译:本文提出了一种使用平铺变换为嵌套for循环生成平铺代码的问题的新颖方法。切片或超节点转换已被广泛用于改善多级内存分层结构中的局部性,以及有效地将循环执行到非均匀内存访问体系结构上。但是,由于非矩形图块的形状和迭代空间的限制,自动生成图块循环的代码可能是非常复杂的编译器工作。通过考虑平行六面体图块和任意迭代空间形状,我们的方法大大增强了以前重写图块循环的工作。现在,将用于切片转换的代码生成的复杂性降低为用于任何线性转换的代码生成的复杂性。比较所有目前提出的方法的实验结果表明,提出的生成切片代码的方法得到了显着加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号