【24h】

Loop Striping: Maximize Parallelism for Nested Loops

机译:循环条带化:最大化嵌套循环的并行度

获取原文
获取原文并翻译 | 示例

摘要

The majority of scientific and Digital Signal Processing (DSP) applications are recursive or iterative. Transformation techniques are generally applied to increase parallelism for these nested loops. Most of the existing loop transformation techniques either can not achieve maximum parallelism, or can achieve maximum parallelism but with complicated loop bounds and loop indexes calculations. This paper proposes a new technique, loop striping, that can maximize parallelism while maintaining the original row-wise execution sequence with minimum overhead. Loop striping groups iterations into stripes, where a stripe is a group of iterations in which all iterations are independent and can be executed in parallel. Theorems and efficient algorithms are proposed for loop striping transformations. The experimental results show that loop striping always achieves better iteration period than software pipelining and loop unfolding, improving average iteration period by 50% and 54% respectively.
机译:大多数科学和数字信号处理(DSP)应用都是递归的或迭代的。通常应用转换技术来增加这些嵌套循环的并行度。大多数现有的循环转换技术要么无法实现最大并行度,要么可以实现最大并行度,但是具有复杂的循环边界和循环索引计算。本文提出了一种新技术,即循环条带化,该技术可以在保持并行性的同时,以最小的开销最大化并行性。循环条带化将迭代分组为多个条带,其中,条带是一组迭代,其中所有迭代都是独立的,并且可以并行执行。提出了用于循环条带化变换的定理和有效算法。实验结果表明,循环条带化总是比软件流水线和循环展开获得更好的迭代周期,平均迭代周期分别提高了50%和54%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号