Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube

Erfan Azarkhish; Christoph Pfister; Davide Rossi; Igor Loi; Luca Benini

首页> 外文期刊>Very Large Scale Integration (VLSI) Systems, IEEE Transactions on >Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube

【24h】

Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube

机译：智能内存多维数据集中的近内存计算的基于逻辑的互连设计

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Hybrid memory cube (HMC) has promised to improve bandwidth, power consumption, and density for the next-generation main memory systems. In addition, 3-D integration gives a second shot for revisiting near memory computation to fill the gap between processors and memories. In this paper, we study the required infrastructure inside the HMC to support near memory computation in a modular and flexible fashion. We propose a fully backward compatible extension to the standard HMC called the smart memory cube, and design a high bandwidth, low latency, and Advanced eXtensible Interface-4.0 compatible logic base (LoB) interconnect to serve the huge bandwidth demand by the HMCs serial links, and to provide extra bandwidth to a generic processor-in-memory (PIM) device embedded in the LoB. This interconnect features a novel address scrambling mechanism for the reduction in the vault/bank conflicts and robust operation even in the presence of pathological traffic patterns. Our cycle accurate simulation results demonstrate that this interconnect can easily meet the demands of the latest HMC specifications (up to 205 GB/s read bandwidth with 4 serial links and 32 memory vaults for injected random traffic). It further shown that the default addressing scheme of the HMC (low interleaving) is not reliable enough and operates poorly in the presence of specific traffic patterns from real applications. This is while the proposed scrambling mechanism operates robustly even in those cases. The interference between the PIM traffic and the main links is shown to be negligible when the number of PIM ports is limited to 2, requesting up to 64 GB/s without pushing the system into saturation. Finally, logic synthesis with Synopsys Design Compiler confirms that our interconnect is implementable and effective in terms of power, area, and timing (power consumption less than 5 mW up to 1 GHz and area less than 0.4 mm2).

机译：混合存储多维数据集（HMC）已承诺改善下一代主存储系统的带宽，功耗和密度。此外，3-D集成为重新访问近存储器计算填补了处理器和存储器之间的空白提供了第二个机会。在本文中，我们研究了HMC内部所需的基础结构，以模块化和灵活的方式支持近内存计算。我们提议对标准HMC进行完全向后兼容的扩展，称为智能内存多维数据集，并设计高带宽，低延迟和高级可扩展接口-4.0兼容逻辑库（LoB）互连，以满足HMC串行链路的巨大带宽需求，并为嵌入LoB的通用内存处理器（PIM）设备提供额外的带宽。这种互连具有新颖的地址加扰机制，即使在存在病理性流量模式的情况下，也可减少金库/银行冲突并实现稳定的操作。我们的周期精确仿真结果表明，这种互连可以轻松满足最新HMC规范的要求（通过4个串行链路和32个存储库用于注入随机流量，读取带宽高达205 GB / s）。它进一步表明，HMC的默认寻址方案（低交织）不够可靠，并且在存在来自实际应用程序的特定流量模式时运行不佳。尽管所提出的加扰机制即使在那些情况下也可以稳定运行。当PIM端口的数量限制为2个时，PIM流量与主链路之间的干扰被认为可以忽略不计，从而请求高达64 GB / s的速度而不会使系统陷入饱和。最后，通过Synopsys Design Compiler进行的逻辑综合证实了我们的互连在功率，面积和时序方面（在1 GHz以下时功耗小于5 mW，面积在0.4 mm2以下）是可实现且有效的。

著录项

来源
《Very Large Scale Integration (VLSI) Systems, IEEE Transactions on》 |2017年第1期|210-223|共14页
作者
Erfan Azarkhish; Christoph Pfister; Davide Rossi; Igor Loi; Luca Benini;
展开▼
作者单位

Department of Electrical, Electronic and Information Engineering, University of Bologna, Bologna, Italy;

Signal and Information Processing Laboratory, ETH Zurich, Zürich, Switzerland;

Department of Electrical, Electronic and Information Engineering, University of Bologna, Bologna, Italy;

Department of Electrical, Electronic and Information Engineering, University of Bologna, Bologna, Italy;

Department of Electrical, Electronic and Information Engineering, University of Bologna, Bologna, Italy;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Random access memory; Bandwidth; Program processors; Memory management; Standards; Robustness;

机译：随机存取存储器;带宽;程序处理器;存储器管理;标准;稳健性;

相似文献

外文文献
中文文献
专利

1. There and Back Again: Optimizing the Interconnect in Networks of Memory Cubes [J] . Matthew Poremba, Itir Akgun, Jieming Yin, Computer architecture news . 2017,第2期

机译：一遍又一遍：优化存储立方体网络中的互连
2. Design of multi-state and smart-bias components using Shape Memory Alloy and Shape Memory Polymer composites [J] . Pritha Ghosh, Ashwin Rao, Arun R. Srinivasa Materials & design . 2013,第FEBa期

机译：使用形状记忆合金和形状记忆聚合物复合材料设计多态和智能偏置组件
3. A Memory System Design Framework: Creating Smart Memories [J] . Amin Firoozshahian, Alex Solomatnikov, Ofer Shacham, Computer architecture news . 2009,第3期

机译：内存系统设计框架：创建智能内存
4. Memory-centric system interconnect design with Hybrid Memory Cubes [C] . Kim Gwangsun, Kim John, Ahn Jung Ho, International Conference on Parallel Architectures and Compilation Techniques . 2013

机译：具有混合存储立方体的以存储器为中心的系统互连设计
5. Interconnect and Memory Design for Intelligent Mobile System [D] . Wang, Jingcheng. 2020

机译：智能移动系统的互连和内存设计
6. Smarter Traffic Prediction Using Big Data In-Memory Computing Deep Learning and GPUs [O] . Muhammad Aqib, Rashid Mehmood, Ahmed Alzahrani, 2019

机译：使用大数据内存计算深度学习和GPU进行更智能的流量预测
7. A Case for Near Memory Computation Inside the Smart Memory Cube [O] . Azarkhish Erfan, Rossi Davide, Loi Igor, 2016

机译：智能内存多维数据集中的近内存计算的一个案例
8. Integrated, Bistable Gain-Quenched Vertical Cavity/In-Plane Lasers for SmartPixel Switching, Free-Space Interconnects, and Optical Memory Applications [R] . Shire, D. B., Tang, C. L. 1996

机译：用于智能像素开关，自由空间互连和光存储器应用的集成双稳增益淬火垂直腔/面内激光器

Logic-Base Interconnect Design for Near Memory Computing in the Smart Memory Cube

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅