首页> 外文会议> >DStride: data-cache miss-address-based stride prefetching scheme for multimedia processors

【24h】

DStride: data-cache miss-address-based stride prefetching scheme for multimedia processors

机译：DStride：用于多媒体处理器的基于数据缓存的失误地址的跨步预取方案

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Prefetching reduces cache miss latency by moving data up in memory hierarchy before they are actually needed. Recent hardware-based stride prefetching techniques mostly rely on the processor pipeline information (e.g. program counter and branch prediction table) for prediction. Continuing developments in processor microarchitecture drastically change core pipeline design and require that existing hardware-based stride prefetching techniques be adapted to the evolving new processor architectures. In this paper we present a new hardware-based stride prefetching technique, called DStride, that is independent of processor pipeline design changes. In this new design, the first-level data cache miss address stream is used for the stride prediction. The miss addresses are separated into load stream and store stream to increase the efficiency of the predictor. They are checked separately against the recent miss address stream to detect the strides. The detected steady strides are maintained in a table that also performs look-ahead stride prefetching when the processor stride reference rate is higher than the prefetch request service rate. We evaluated our design with multimedia workloads using execution-driven simulation with SimpleScalar toolset. Our experiments show that DStride is very effective in reducing overall pipeline stalls due to cache miss latency, especially for stride-intensive applications such as multimedia workloads.

机译：预取通过在实际需要之前在内存层次结构中上移数据来减少高速缓存未命中延迟。最近的基于硬件的步幅预取技术主要依靠处理器管线信息（例如程序计数器和分支预测表）进行预测。处理器微体系结构的不断发展极大地改变了核心流水线设计，并要求现有的基于硬件的跨步预取技术适应不断发展的新处理器体系结构。在本文中，我们提出了一种新的基于硬件的步幅预取技术，称为DStride，它独立于处理器管线设计更改。在这种新设计中，第一级数据高速缓存未命中地址流用于步幅预测。未命中地址分为负载流和存储流，以提高预测器的效率。针对最近的未命中地址流分别检查它们，以检测跨步。在处理器步幅参考速率高于预取请求服务速率时，将检测到的稳定步幅保存在一个表中，该表还执行超前步幅预取。我们使用具有SimpleScalar工具集的执行驱动的仿真，通过多媒体工作负载评估了我们的设计。我们的实验表明，DStride在减少由于缓存未命中延迟而导致的总体流水线停滞方面非常有效，特别是对于跨步密集型应用（例如多媒体工作负载）而言。

著录项

来源
《》|2001年|P.62-70|共9页
会议地点
作者
Hariprakash; G.; Achutharaman; R.; Omondi; A.R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. An integrated prefetching/caching scheme in multimedia servers [J] . Kim Eunsam, Liu Jonathan C. L. Journal of network and computer applications . 2017,第JUNa期

机译：多媒体服务器中的集成预取/缓存方案
2. Pattern-driven prefetching for multimedia applications on embedded processors [J] . Sbeyti H, Niar S, Eeckhout L Journal of systems architecture . 2006,第4期

机译：嵌入式处理器上多媒体应用程序的模式驱动预取
3. Evaluation of hardware-based stride and sequential prefetching in shared-memory multiprocessors [J] . Dahlgren F., Stenstrom P. IEEE Transactions on Parallel and Distributed Systems . 1996,第4期

机译：评估共享内存多处理器中基于硬件的步幅和顺序预取
4. DSTRIDE: data-cache miss-address-based stride prefetching scheme for multimedia processors [C] . Hariprakash G, Achutharaman R, Amos R. Omondi Australasian Computer Systems Architecture Conference . 2001

机译：DSTRITE：用于多媒体处理器的基于数据缓存未命中的STRIDE预取方案
5. Compiler-assisted hardware-based data prefetching for next generation processors. [D] . Guo, Yao. 2007

机译：面向下一代处理器的基于编译器的基于硬件的数据预取。
6. Data-Prefetching Scheme Based on Playback Delay and Positioning Satisfaction in Peer-To-Peer Video-On-Demand System [O] . Lei Wang, Xiaorui Li, Yaqiu Liu, 2018

机译：点对点视频点播系统中基于播放延迟和定位满意度的数据预取方案
7. DSTRIDE: Data-cache miss-address-based stride prefetching scheme for multimedia processors [O] . Hariprakash Achutharaman And, Hariprakash G, Achutharaman R, 2007

机译：DSTRIDE：用于多媒体处理器的基于数据缓存的失误地址的跨步预取方案

DStride: data-cache miss-address-based stride prefetching scheme for multimedia processors

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅