【24h】

Fast, Ring-Based Design of 3-D Stacked DRAM

机译:3-D堆叠DRAM的基于环的快速设计

获取原文
获取原文并翻译 | 示例

摘要

As computer memory increases in size and processors continue to get faster, the memory subsystem becomes a bottleneck to system performance. To mitigate the relatively slow dynamic random access memory (DRAM) chip speeds, a new generation of 3-D stacked DRAM is being developed, with lower power consumption and higher bandwidth. This paper proposes the use of 3-D ring-based data fabrics for fast data transfer between the chips in the 3-D stacked DRAM. The ring-based data fabric uses a fast standing wave oscillator to clock its transactions. With a fast clocking scheme and multiple channels sharing the same bus, more channels are utilized while significantly reducing the number of through-silicon vias. Our memory architecture using a ring-based scheme (MARS) can effectively trade off power, throughput, and latency to improve the system performance for different application spaces. Experimental results show that our ring-based data fabric can reduce read latencies and power consumption. MARS variants can deliver better latency (up to similar to 4x), power (up to similar to 8x), and performance per watt (up to similar to 8x) over high bandwidth memory. We also compare our approach withWide I/O, which is designed for power-constrained systems. MARS variants provide better latency (up to similar to 8x) with similar performance per watt.
机译:随着计算机内存大小的增加和处理器的不断提高,内存子系统成为系统性能的瓶颈。为了减轻相对较慢的动态随机存取存储器(DRAM)芯片速度,正在开发具有更低功耗和更高带宽的新一代3-D堆叠DRAM。本文提出了使用3-D环形数据结构在3-D堆叠DRAM中的芯片之间进行快速数据传输的建议。基于环的数据结构使用快速驻波振荡器来为其事务计时。利用快速时钟方案和多个通道共享同一条总线,可以利用更多的通道,同时显着减少硅通孔的数量。我们使用基于环的方案(MARS)的内存体系结构可以有效地权衡功率,吞吐量和延迟,以提高不同应用程序空间的系统性能。实验结果表明,我们基于环的数据结构可以减少读取延迟和功耗。 MARS变体可以在高带宽内存上提供更好的延迟(高达4倍),功率(高达8倍)和每瓦性能(高达8倍)。我们还将我们的方法与专为功耗受限的系统设计的宽I / O进行了比较。 MARS变体具有更好的延迟(高达8倍),每瓦性能相似。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号