首页> 外文会议>Asia and South Pacific Design Automation Conference >Efficient mapping of CDFG onto coarse-grained reconfigurable array architectures
【24h】

Efficient mapping of CDFG onto coarse-grained reconfigurable array architectures

机译:高效映射CDFG在粗粒度可重新配置阵列架构上

获取原文

摘要

In the approaching era of IoT, flexible and low power accelerators have become essential to meet aggressive energy efficiency targets. During the last few decades, Coarse Grain Reconfigurable Arrays (CGRA) have demonstrated high energy efficiency as accelerators, especially for high-performance streaming applications. While existing CGRAs mostly rely on partial and full predication techniques to support conditional branches, inefficient architecture and mapping support for handling control flow limits the use of CGRAs in accelerating either only inner loop bodies, or transformed loops specifically adapted to the target CGRA. This paper proposes a novel CGRA architecture with support for jump and conditional jump instructions and a lightweight global synchronization mechanism to enable complete Control Data Flow Graph (CDFG) mapping in an ultra-low-power environment. The architecture is coupled with a complete design flow that efficiently maps applications with heavy control flow starting from a generic C language description. The proposed mapping approach reduces the impact of wasteful instruction issues in the conventional approaches of predication providing an average energy improvement of 1.44× and 1.6× when compared to the state of the art partial and full predication techniques. Moreover, the proposed method achieves an average speed-up up to 21× and an energy improvement up to 50.42× while executing applications with heavy control flow with respect to sequential execution on a low-power embedded CPU, demonstrating its suitability for next generation IoT applications.
机译:在物联网时代的临近,灵活和低功耗加速器已经成为必不可少的,以满足积极的能源效率目标。在过去的几十年中,粗粮可重构阵列(CGRA)已经证明了高能量效率的加速器,尤其是对于高性能的流媒体应用。尽管现有CGRAs大多依靠部分和完全预测技术来支持条件分支的,低效的体系结构和映射支持用于处理控制流限制在加速或者仅内部循环机构,或特别适合目标CGRA转化循环中使用的CGRAs。本文提出了一种新颖的CGRA架构与跳跃和条件跳转指令和轻质全球同步机制,使在一个超低功率环境的完全控制数据流图(CDFG)映射支持。该架构加上完整的设计流程以便有效地将应用程序与重控制流从一个普通的C语言描述开始。相比,在本领域部分和完全预测技术的状态下,当提出的映射方法减少浪费指令问题预测的提供的1.44倍和1.6×平均能量的改进的常规方法的影响。此外,提出的方法实现的平均加速高达21×和能量改善高达50.42×而执行与重控制流量的应用相对于低功率嵌入式CPU上顺序执行,这表明其是否适合下一代的IoT应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号