首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >MAHA: An Energy-Efficient Malleable Hardware Accelerator for Data-Intensive Applications
【24h】

MAHA: An Energy-Efficient Malleable Hardware Accelerator for Data-Intensive Applications

机译:MAHA:用于数据密集型应用的节能型可延展硬件加速器

获取原文
获取原文并翻译 | 示例

摘要

For data-intensive applications, energy expended in on-chip computation constitutes only a small fraction of the total energy consumption. The primary contribution comes from transporting data between off-chip memory and on-chip computing elements—a limitation referred to as the Von-Neumann bottleneck. In such a scenario, improving the compute energy through parallel processing or on-chip hardware acceleration brings minor improvements to the total energy requirement of the system. We note that an effective solution to mitigate the Von-Neumann bottleneck is to develop a framework that enables computing in off-chip nonvolatile memory arrays, where the data reside permanently. In this paper, we present a malleable hardware (MAHA) reconfigurable framework that modifies nonvolatile CMOS-compatible flash memory array for on-demand reconfigurable computing. MAHA is a spatio-temporal mixed-granular hardware reconfigurable framework, which utilizes the memory for storage as well as lookup table-based computation (hence malleable) and uses a low-overhead hierarchical interconnect fabric for communication between processing elements. A detailed design of the malleable hardware together with a comprehensive application mapping flow is presented. Design overheads carefully estimated at the 45-nm technology node indicate that for a set of common kernels, MAHA achieves a 91X improvement in energy efficiency over a software-only solution with negligible impact on memory performance in normal mode. The proposed design changes incur only 6% memory area overhead.
机译:对于数据密集型应用,片上计算中消耗的能量仅占总能耗的一小部分。主要的贡献来自在片外存储器和片上计算元件之间的数据传输-这种限制称为Von-Neumann瓶颈。在这种情况下,通过并行处理或片上硬件加速来提高计算能量会使系统的总能量需求略有改善。我们注意到,缓解Von-Neumann瓶颈的有效解决方案是开发一种框架,该框架使能够在永久存在数据的片外非易失性存储器阵列中进行计算。在本文中,我们提出了可延展的硬件(MAHA)可重配置框架,该框架可修改非易失性CMOS兼容闪存阵列,以进行按需可重配置计算。 MAHA是时空混合粒度硬件可重配置框架,它利用内存进行存储以及基于查找表的计算(因此具有延展性),并使用低开销的分层互连结构在处理元素之间进行通信。提出了可延展硬件的详细设计以及全面的应用程序映射流程。在45纳米技术节点处仔细估算的设计开销表明,对于一组通用内核,MAHA的能效比仅软件解决方案提高了91倍,而对普通模式下的内存性能的影响可忽略不计。提议的设计更改仅产生6%的内存区域开销。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号