Architecting On-Chip DRAM Cache for Simultaneous Miss Rate and Latency Reduction

F. Hameed; L. Bauer; J. Henkel

首页> 外文期刊>IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems >Architecting On-Chip DRAM Cache for Simultaneous Miss Rate and Latency Reduction

【24h】

Architecting On-Chip DRAM Cache for Simultaneous Miss Rate and Latency Reduction

机译：设计片上DRAM高速缓存以同时降低丢失率和延迟

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

On-chip dynamic random access memory (DRAM) cache has been recently employed in the memory hierarchy to mitigate the widening latency gap between high-speed cores and off-chip memory. Two important parameters are the DRAM cache miss rate (D$-MR) and the DRAM cache hit latency (D$-HL), as they strongly influence the performance. These parameters depend upon the DRAM set mapping policy. Recently proposed DRAM set mapping policies are predominantly optimized for either D$-MR or D$-HL. We propose novel DRAM set mapping policies that simultaneously reduce D$-MR (via high associativity) and D$-HL (via improved row buffer hit rates). To further improve the D$-HL, we propose a small and low latency DRAM Tag cache (DTC) structure that can quickly determine whether an access to the DRAM cache will be a hit or a miss. The performance of the proposed DTC depends upon the DTC hit rate. To increase it, we present a novel DTC insertion policy that also increases the DTC hit rate. We investigate the latency and miss rate tradeoffs when designing a DRAM cache hierarchy and analyze the effects of different policies on the overall performance. We evaluate our policies on a wide variety of workloads and compare its performance with three recent proposals for on-chip DRAM caches. For a 16-core system, our set mapping policy along with our DTC and its adaptive DTC insertion policy improve the harmonic mean instruction per cycle throughput by 25.4%, 15.5%, and 7.3% compared to state-of-the-art, while requiring 55% less storage overhead for DRAM cache hit/miss prediction.

机译：片上动态随机存取存储器（DRAM）缓存最近已在存储器层次结构中使用，以减轻高速内核与片外存储器之间不断扩大的等待时间差距。两个重要参数是DRAM高速缓存未命中率（D $ -MR）和DRAM高速缓存命中等待时间（D $ -HL），因为它们会严重影响性能。这些参数取决于DRAM集映射策略。最近提出的DRAM集映射策略主要针对D $ -MR或D $ -HL进行了优化。我们提出了新颖的DRAM集映射策略，该策略同时降低D $ -MR（通过高关联性）和D $ -HL（通过提高行缓冲区命中率）。为了进一步改善D $ -HL，我们提出了一种小型且低延迟的DRAM标签缓存（DTC）结构，该结构可以快速确定对DRAM缓存的访问是命中还是未命中。提议的故障诊断代码的性能取决于故障诊断代码命中率。为了增加它，我们提出了一种新颖的DTC插入策略，该策略还可以提高DTC命中率。我们在设计DRAM缓存层次结构时研究了延迟和未命中率的权衡，并分析了不同策略对整体性能的影响。我们评估了针对各种工作负载的策略，并将其性能与最近针对片上DRAM缓存的三个建议进行了比较。对于16核系统，与最先进的技术相比，我们的设置映射策略以及DTC及其自适应DTC插入策略将每个周期的谐波平均指令吞吐量提高了25.4％，15.5％和7.3％。 DRAM缓存命中/丢失预测所需的存储开销减少了55％。

著录项

来源
《IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems》 |2016年第4期|651-664|共14页
作者
F. Hameed; L. Bauer; J. Henkel;
展开▼
作者单位

Fazal Hameed is with the KIT, Germany.(Email: hameed@ira.uka.de);

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Architecture; cache; embedded systems; memory; memory hierarchy;

机译：体系结构;缓存;嵌入式系统;内存;内存层次结构;

相似文献

外文文献
中文文献
专利

1. The cache DRAM architecture: a DRAM with an on-chip cache memory [J] . Hidaka H., Matsuda Y. IEEE Micro . 1990,第2期

机译：缓存DRAM架构：具有片上缓存的DRAM
2. Die-Stacked DRAM Caches for Servers Hit Ratio, Latency, or Bandwidth? Have It All with Footprint Cache [J] . Djordje Jevdjic, Stavros Volos, Babak Falsafi Computer architecture news . 2013,第3期

机译：芯片堆叠式DRAM缓存是针对服务器的命中率，延迟还是带宽？拥有足迹缓存
3. A Survey Of Architectural Approaches for Managing Embedded DRAM and Non-Volatile On-Chip Caches [J] . Mittal Sparsh, Vetter Jeffrey S., Li Dong Parallel and Distributed Systems, IEEE Transactions on . 2015,第6期

机译：管理嵌入式DRAM和非易失性片上高速缓存的体系结构方法的概述
4. Fundamental Latency Trade-off in Architecting DRAM Caches: Outperforming Impractical SRAM-Tags with a Simple and Practical Design [C] . Qureshi Moinuddin K., Loh Gabe H. IEEE/ACM International Symposium on Microarchitecture . 2012

机译：架构DRAM缓存时的基本延迟权衡：通过简单实用的设计胜过不切实际的SRAM标签
5. Circuit and microarchitectural techniques for processor on-chip cache leakage power reduction. [D] . Kim, Nam Sung. 2004

机译：用于减少处理器片上高速缓存泄漏功率的电路和微体系结构技术。
6. In-DRAM Cache Management for Low Latency and Low Power 3D-Stacked DRAMs [O] . Ho Hyun Shin, Eui-Young Chung 2019

机译：用于低延迟和低功耗3D堆叠DRAM的DRAM中缓存管理
7. A Survey Of Architectural Approaches for Managing Embedded DRAM and Non-Volatile On-Chip Caches [O] . Sparsh Mittal, Jeffrey S. Vetter, Dong Li 2015

机译：嵌入式DRam和非易失性片上缓存管理的架构方法综述

Architecting On-Chip DRAM Cache for Simultaneous Miss Rate and Latency Reduction

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅