首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube
【24h】

Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube

机译:智能内存多维数据集中的近内存计算的基于逻辑的互连设计

获取原文
获取原文并翻译 | 示例

摘要

Hybrid memory cube (HMC) has promised to improve bandwidth, power consumption, and density for the next-generation main memory systems. In addition, 3-D integration gives a second shot for revisiting near memory computation to fill the gap between processors and memories. In this paper, we study the required infrastructure inside the HMC to support near memory computation in a modular and flexible fashion. We propose a fully backward compatible extension to the standard HMC called the smart memory cube, and design a high bandwidth, low latency, and Advanced eXtensible Interface-4.0 compatible logic base (LoB) interconnect to serve the huge bandwidth demand by the HMCs serial links, and to provide extra bandwidth to a generic processor-in-memory (PIM) device embedded in the LoB. This interconnect features a novel address scrambling mechanism for the reduction in the vault/bank conflicts and robust operation even in the presence of pathological traffic patterns. Our cycle accurate simulation results demonstrate that this interconnect can easily meet the demands of the latest HMC specifications (up to 205 GB/s read bandwidth with 4 serial links and 32 memory vaults for injected random traffic). It further shown that the default addressing scheme of the HMC (low interleaving) is not reliable enough and operates poorly in the presence of specific traffic patterns from real applications. This is while the proposed scrambling mechanism operates robustly even in those cases. The interference between the PIM traffic and the main links is shown to be negligible when the number of PIM ports is limited to 2, requesting up to 64 GB/s without pushing the system into saturation. Finally, logic synthesis with Synopsys Design Compiler confirms that our interconnect is implementable and effective in terms of power, area, and timing (power consumption less than 5 mW up to 1 GHz and area less than 0.4 mm2).
机译:混合存储多维数据集(HMC)已承诺改善下一代主存储系统的带宽,功耗和密度。此外,3-D集成为重新访问近存储器计算填补了处理器和存储器之间的空白提供了第二个机会。在本文中,我们研究了HMC内部所需的基础结构,以模块化和灵活的方式支持近内存计算。我们提议对标准HMC进行完全向后兼容的扩展,称为智能内存多维数据集,并设计高带宽,低延迟和高级可扩展接口-4.0兼容逻辑库(LoB)互连,以满足HMC串行链路的巨大带宽需求,并为嵌入LoB的通用内存处理器(PIM)设备提供额外的带宽。这种互连具有新颖的地址加扰机制,即使在存在病理性流量模式的情况下,也可减少金库/银行冲突并实现稳定的操作。我们的周期精确仿真结果表明,这种互连可以轻松满足最新HMC规范的要求(通过4个串行链路和32个存储库用于注入随机流量,读取带宽高达205 GB / s)。它进一步表明,HMC的默认寻址方案(低交织)不够可靠,并且在存在来自实际应用程序的特定流量模式时运行不佳。尽管所提出的加扰机制即使在那些情况下也可以稳定运行。当PIM端口的数量限制为2个时,PIM流量与主链路之间的干扰被认为可以忽略不计,从而请求高达64 GB / s的速度而不会使系统陷入饱和。最后,通过Synopsys Design Compiler进行的逻辑综合证实了我们的互连在功率,面积和时序方面(在1 GHz以下时功耗小于5 mW,面积在0.4 mm2以下)是可实现且有效的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号