首页> 外文会议>Symposium on Integrated Circuits and Systems Design >Unlocking the Full Potential of Heterogeneous Accelerators by Using a Hybrid Multi-Target Binary Translator
【24h】

Unlocking the Full Potential of Heterogeneous Accelerators by Using a Hybrid Multi-Target Binary Translator

机译:通过使用混合多目标二进制转换器释放异构加速器的全部潜力

获取原文

摘要

Embedded systems comprise multiple accelerators to exploit both Instruction and Data-Level parallelism, maximizing performance per watt. However, the use of accelerators usually involves changes in the source code, not maintaining binary compatibility and increasing time-to-market. Therefore, Binary Translation (BT) mechanisms emerge as an alternative, since they dynamically detect and transform parts of the application for optimization without needing any prior modification in the code. Nevertheless, the available BT approaches are limited to one single accelerator, which may not always result in the optimal energy-performance trade-off, since parts of an application may have code that will benefit the most from one accelerator or another depending on its available intrinsic parallelism. Given that, this work proposes a Hybrid Multi-target Binary Translator (HMTBT). Our HMTBT is capable of transparently translating code to different accelerators: a CGRA (Coarse-Grained Reconfigurable Architecture) and a NEON engine, and automatically dispatching the translation to the most well-suited one, according to the type of the available parallelism (ILP or DLP) at the moment. HMTBT improves performance by 54% and 76% and saves energy by 15% and 25% when comparing to a BT targeting a CGRA only and another targeting a NEON engine only. We also compare the HMTBT to a system that features both CGRA and NEON BT mechanisms, showing 12% of energy savings and 14% of performance improvements, on average.
机译:嵌入式系统包含多个加速器,可同时利用指令级和数据级并行性,从而最大程度地提高每瓦性能。但是,使用加速器通常会涉及源代码的更改,而不是保持二进制兼容性并增加上市时间。因此,二进制翻译(BT)机制作为一种替代方案出现了,因为它们动态地检测和转换应用程序的各个部分以进行优化,而无需事先对代码进行任何修改。然而,可用的BT方法仅限于一个加速器,这可能并不总是导致最佳的能源性能折衷,因为应用程序的某些部分可能会根据其可用性从一个或多个加速器中受益最大的代码内在的并行性。鉴于此,这项工作提出了一种混合多目标二进制转换器(HMTBT)。我们的HMTBT能够透明地将代码翻译为不同的加速器:CGRA(粗粒度可重配置架构)和NEON引擎,并根据可用并行性(ILP或DLP)。与仅面向CGRA的BT和仅面向NEON引擎的BT相比,HMTBT可以将性能提高54%和76%,并节省能源15%和25%。我们还将HMTBT与同时具有CGRA和NEON BT机制的系统进行了比较,平均显示出12%的能源节省和14%的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号