首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >Improving Nested Loop Pipelining on Coarse-Grained Reconfigurable Architectures
【24h】

Improving Nested Loop Pipelining on Coarse-Grained Reconfigurable Architectures

机译:改进粗粒度可重构体系结构上的嵌套循环流水线

获取原文
获取原文并翻译 | 示例

摘要

Coarse-grained reconfigurable architecture (CGRA) is a promising architecture with high performance, high power efficiency, and attraction of flexibility. The computation-intensive portions of applications, i.e., loops, are often implemented on CGRAs for acceleration. The loop pipelining techniques are usually used to exploit the parallelism of loops. However, for nested loops, the existing loop pipelining methods often result in poor hardware utilization and low execution performance. To tackle this problem, this paper makes three contributions: 1) we propose the use of affine transformation to facilitate nested loop pipelining; 2) based on polyhedral model, we present a precise and general formulation of the nested loop pipelining problem on a CGRA; and 3) using the insights from problem formulation, we design a joint affine transformation and multipipeline merging approach to improve the performance of nested loop on CGRA. The experimental results show that our approach can improve the performance of nested loops up to 35% on average, compared with the state-of-the-art techniques.
机译:粗粒度可重构体系结构(CGRA)是一种有前途的体系结构,具有高性能,高能效和灵活性。应用程序的计算密集型部分(即循环)通常在CGRA上实现以加速。循环流水线技术通常用于开发循环的并行性。但是,对于嵌套循环,现有的循环流水线方法通常会导致较差的硬件利用率和较低的执行性能。为了解决这个问题,本文做出了三点贡献:1)我们建议使用仿射变换来简化嵌套循环流水线; 2)基于多面体模型,我们在CGRA上给出了嵌套循环流水线问题的精确和通用公式; 3)利用问题表述的见解,设计了仿射变换和多管道合并的联合方法,以提高CGRA上嵌套循环的性能。实验结果表明,与最新技术相比,我们的方法可以将嵌套循环的性能平均提高35%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号