首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >Optimizing Spatial Mapping of Nested Loop for Coarse-Grained Reconfigurable Architectures
【24h】

Optimizing Spatial Mapping of Nested Loop for Coarse-Grained Reconfigurable Architectures

机译:为粗粒度可重构体系结构优化嵌套循环的空间映射

获取原文
获取原文并翻译 | 示例

摘要

Coarse-grained reconfigurable architectures (CGRAs) have drawn increasing attention due to their flexibility and efficiency. Loops in applications are often mapped onto CGRAs for acceleration, and the mapping of loops onto CGRA is quite a challenging work due to the parallel execution paradigm and constrained hardware resource. To map loops onto CGRAs efficiently, it is important to transform loops into pieces that obey hardware resource constraints with less overhead (e.g., communication and configuration overhead). In this paper, we tackle this problem by establishing a performance optimization problem, including loop transformation and back- end placing and routing. A novel searching strategy is also designed to find the optimal result efficiently. Finally, we built a complete flow of mapping loop nests onto CGRA. Experiment results on most kernels of the Polybench show that our proposed approach can improve the performance of the kernels by 42% on average, as compared with the state-of-the-art methods. The runtime complexity of our approach is also acceptable.
机译:粗粒度可重构体系结构(CGRA)由于其灵活性和效率而引起了越来越多的关注。应用程序中的循环通常被映射到CGRA上以进行加速,由于并行执行范例和受限制的硬件资源,将循环映射到CGRA上是一项艰巨的工作。为了将循环有效地映射到CGRA,将循环转换成符合硬件资源约束且开销较小(例如,通信和配置开销)的片段很重要。在本文中,我们通过建立性能优化问题来解决此问题,包括循环转换以及后端布局和路由。还设计了一种新颖的搜索策略来有效地找到最佳结果。最后,我们在CGRA上构建了完整的映射循环嵌套流程。在Polybench的大多数内核上的实验结果表明,与最新方法相比,我们提出的方法可以将内核的性能平均提高42%。我们的方法的运行时复杂性也是可以接受的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号