Conflict-Free Loop Mapping for Coarse-Grained Reconfigurable Architecture with Multi-Bank Memory

Shouyi Yin; Xianqing Yao; Tianyi Lu; Dajiang Liu; Jiangyuan Gu; Leibo Liu; Shaojun Wei

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Conflict-Free Loop Mapping for Coarse-Grained Reconfigurable Architecture with Multi-Bank Memory

【24h】

Conflict-Free Loop Mapping for Coarse-Grained Reconfigurable Architecture with Multi-Bank Memory

机译：具有多存储体的粗粒度可重配置体系结构的无冲突循环映射

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Coarse-grained reconfigurable architecture (CGRA) is a promising architecture with high performance, high power-efficiency and attraction of flexibility. The computation-intensive parts of an application (e.g., loops) are often mapped on CGRA for acceleration. Due to the high parallel data access demands, the architecture with multi-bank memory is proposed to improve parallelism. For CGRA with multi-bank memory, a joint solution, which simultaneously considers the memory partitioning and modulo scheduling, is proposed to achieve a valid mapping with better performance. In this solution, the modulo scheduling and operator scheduling are used to achieve a valid loop mapping and a valid data placement without any memory access conflicts. By avoiding the pipelining stalls caused by conflicts, the performance of loop mapping is greatly improved. The experimental results on benchmarks of the Livermore, Polybench and Mediabench show that our approach can improve the performance of loops on CGRA to 1.89 , 1.49 and 1.37 compared with REGIMap, HTDM and REGIMap with memory partitioning, at cost of an acceptable increase in compilation time.

机译：粗粒度可重构体系结构（CGRA）是一种有前途的体系结构，具有高性能，高能效和灵活性。应用程序（例如循环）的计算密集型部分通常映射在CGRA上以加速执行。由于对并行数据访问的需求很高，因此提出了具有多存储体存储器的体系结构以提高并行性。对于具有多库内存的CGRA，提出了一种同时考虑内存分区和模调度的联合解决方案，以实现具有更好性能的有效映射。在此解决方案中，模调度和操作员调度用于实现有效的循环映射和有效的数据放置，而不会发生任何内存访问冲突。通过避免由冲突引起的流水线停顿，可以大大提高循环映射的性能。在Livermore，Polybench和Mediabench基准测试中的实验结果表明，与具有内存分区的REGIMap，HTDM和REGIMap相比，我们的方法可以将CGRA上的循环性能提高到1.89、1.49和1.37，其代价是可以增加编译时间。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2017年第9期|2471-2485|共15页
作者
Shouyi Yin; Xianqing Yao; Tianyi Lu; Dajiang Liu; Jiangyuan Gu; Leibo Liu; Shaojun Wei;
展开▼
作者单位

Institute of Microelectronics, Tsinghua University, Beijing, China;

Institute of Microelectronics, Tsinghua University, Beijing, China;

Institute of Microelectronics, Tsinghua University, Beijing, China;

Institute of Microelectronics, Tsinghua University, Beijing, China;

Institute of Microelectronics, Tsinghua University, Beijing, China;

Institute of Microelectronics, Tsinghua University, Beijing, China;

Institute of Microelectronics, Tsinghua University, Beijing, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Arrays; Pipeline processing; Registers; Routing; Memory management;

机译：数组;管道处理;寄存器;路由;内存管理;

相似文献

外文文献
中文文献
专利

1. Memory-Aware Loop Mapping on Coarse-Grained Reconfigurable Architectures [J] . Shouyi Yin, Xianqing Yao, Dajiang Liu, IEEE transactions on very large scale integration (VLSI) systems . 2016,第5期

机译：粗粒度可重构体系结构上的内存感知循环映射
2. Mapping Imperfect Loops to Coarse-Grained Reconfigurable Architectures [J] . Hyeonuk Sim, Hongsik Lee, Seongseok Seo, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems . 2016,第7期

机译：将不完美的循环映射到粗粒度的可重构体系结构
3. Optimizing Spatial Mapping of Nested Loop for Coarse-Grained Reconfigurable Architectures [J] . Liu Dajiang, Yin Shouyi, Peng Yu, Very Large Scale Integration (VLSI) Systems, IEEE Transactions on . 2015,第11期

机译：为粗粒度可重构体系结构优化嵌套循环的空间映射
4. Joint loop mapping and data placement for coarse-grained reconfigurable architecture with multi-bank memory [C] . Shouyi Yin, Xianqing Yao, Tianyi Lu, IEEE/ACM International Conference on Computer-Aided Design . 2016

机译：具有多存储体的粗粒度可重配置架构的联合循环映射和数据放置
5. A finite domain constraint approach for placement and routing of coarse-grained reconfigurable architectures. [D] . Saraswat, Rohit. 2010

机译：一种用于粗粒度可重构体系结构的放置和路由的有限域约束方法。
6. High-resolution mapping of architectural DNA binding protein facilitation of a DNA repression loop in Escherichia coli [O] . Nicole A. Becker, L. James Maher III 2015

机译：大肠杆菌中DNA抑制环的建筑DNA结合蛋白促进作用的高分辨率图谱
7. An algorithm for mapping loops onto coarse-grained reconfigurable architectures [O] . Lee Jongeun, Choi K, Dutt ND 2014

机译：用于将循环映射到粗粒度可重构体系结构的算法

Conflict-Free Loop Mapping for Coarse-Grained Reconfigurable Architecture with Multi-Bank Memory

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅