首页> 外文期刊>Computer architecture news >Data Trace Cache: An Application Specific Cache Architecture
【24h】

Data Trace Cache: An Application Specific Cache Architecture

机译:数据跟踪缓存:一种专用缓存结构

获取原文
获取原文并翻译 | 示例

摘要

Benefits of advances in processor technology have long been held hostage to the widening processor-memory gap.Off-chip memory access latency is one of the most critical parameters limiting system performance. Caches have been used as a way of alleviating this problem by reducing the average memory access latency. The memory bottleneck assumes greater significance for high performance computer architectures with high data throughput requirements such as network processors. This paper addresses the memory bottleneck with the goal of minimizing off-chip memory demand and average memory access latency by proposing the use of small application specific compiler-visible data trace caches. We focus on tree data structures which are responsible for a significant component of the memory traffic in several applications. We have observed that tree accesses create a simple to characterize trace of memory references and propose a data trace cache design to exploit the locality of reference in these data traces. Our study reveals that data trace caches can reduce the total number of misses from 7% to 53% for accesses to rooted tree data structures as compared to a conventional cache for a variety of applications for small cache sizes (256 - 1024 bytes). Such caches are in keeping with the philosophy of victim caches, stream buffers, and pre-fetch buffers in that relatively small investments in silicon can realize substantive reduction in off-chip memory bandwidth demand.
机译:长期以来,处理器技术进步的好处一直是处理器与内存之间差距不断扩大的人质。片外存储器访问延迟是限制系统性能的最关键参数之一。缓存已被用作通过减少平均内存访问延迟来缓解此问题的方法。对于具有高数据吞吐量要求的高性能计算机体系结构(例如网络处理器),内存瓶颈具有更大的意义。本文提出了使用小型专用于编译器可见的数据跟踪高速缓存的方法,以最大程度地减少片外存储器需求和平均存储器访问延迟为目标,解决了存储器瓶颈。我们专注于树型数据结构,这些结构在多个应用程序中占内存流量的重要部分。我们已经观察到,树访问创建了一种简单的方法来表征内存引用的跟踪,并提出了数据跟踪缓存设计以利用这些数据跟踪中的引用局部性。我们的研究表明,与用于小型缓存大小(256-1024字节)的各种应用程序的常规缓存相比,数据跟踪缓存可以将访问根树数据结构的未命中总数从7%降低到53%。此类高速缓存符合受害高速缓存,流缓冲区和预取缓冲区的原理,因为相对较小的硅投资可以实现对片外存储器带宽需求的大幅减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号