首页> 外文会议>2018 Design, Automation amp; Test in Europe Conference amp; Exhibition >LASER: A hardware/software approach to accelerate complicated loops on CGRAs
【24h】

LASER: A hardware/software approach to accelerate complicated loops on CGRAs

机译:激光:一种硬件/软件方法,可加速CGRA上的复杂循环

获取原文
获取原文并翻译 | 示例

摘要

Coarse-Grained Reconfigurable Arrays (CGRAs) are popular accelerators predominantly used in streaming, filtering, and decoding applications. Due to their high performance and high power-efficiency, CGRAs can be a promising solution to accelerate the loops of general purpose applications also. However, the loops in general purpose applications are often complicated, like loops with perfect and imperfect nests and loops with nested if-then-else's (conditionals). We argue that the existing hardware-software solutions to execute branches and conditions are inefficient. In order to efficiently execute complicated loops on CGRAs, we present a hardware-software hybrid solution: LASER - a comprehensive technique to accelerate compute-intensive loops of applications. In LASER, compiler transforms complex loops, maps them to the CGRA, and lays them out in the memory in a specific manner, such that the hardware can fetch and execute the instructions from the right path at runtime. LASER achieves a geomean performance improvement of 40.91% and utilization of 43.43% with 46% lower energy consumption.
机译:粗粒度可重配置阵列(CGRA)是流行的加速器,主要用于流传输,过滤和解码应用程序。由于其高性能和高功率效率,CGRA也是加速通用应用程序环路的有前途的解决方案。但是,通用应用程序中的循环通常很复杂,例如具有完美和不完美嵌套的循环以及具有嵌套if-then-else(条件)的循环。我们认为,现有的执行分支和条件的软硬件解决方案效率低下。为了有效地在CGRA上执行复杂的循环,我们提出了一种硬件-软件混合解决方案:LASER-一种加速应用程序的计算密集型循环的综合技术。在LASER中,编译器转换复杂的循环,将其映射到CGRA,并以特定的方式将其布置在内存中,以便硬件可以在运行时从正确的路径获取并执行指令。 LASER的几何平均性能提高了40.91%,利用率达到了43.43%,能耗降低了46%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号