首页> 外文会议>IEEE Pacific Rim Conference on Communications, Computers and Signal Processing >Cache prefetching and speculation on multi-threaded processors

【24h】

Cache prefetching and speculation on multi-threaded processors

机译：多线程处理器上的缓存预取和推测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Data prefetching is an important mechanism for hiding memory latency in single-threaded, desktop workloads. For multi-threaded, commercial workloads, prefetching offers much more modest improvements in performance at a high cost in cache power and bandwidth to the higher level caches. This paper shows that by combining speculation with a selective prefetching scheme, we can reduce the cache access power overhead while improving performance. We demonstrate that “likely-to-miss” load instructions can be accurately identified and we propose two hardware-based techniques for improving load latencies in multi-threaded commercial workloads. First, we modify a next-four-lines prefetching scheme to only perform the prefetch for likely-to-miss loads. Second, we forward addresses for likely-to-miss loads to the L2 and L3 caches for tag look-up immediately after address translation. Combined, these two techniques reduce the extra cache access power of the L3 cache by up to 53% while slightly improving performance when compared with a simple next-four-lines prefetcher running standard, commercial-workload benchmarks.

机译：数据预取是一种用于隐藏单线程台式机工作负载中的内存延迟的重要机制。对于多线程的商业工作负载，预取以更高的缓存能力和更高级别缓存的带宽成本提供了性能方面的适度改进。本文表明，通过将推测与选择性预取方案相结合，我们可以在提高性能的同时减少缓存访问功率的开销。我们证明了可以准确识别“可能丢失”的加载指令，并提出了两种基于硬件的技术来改善多线程商业工作负载中的加载延迟。首先，我们修改后四行的预取方案，以仅对可能丢失的负载执行预取。其次，我们在地址转换后立即将可能丢失的地址转发给L2和L3缓存，以进行标签查找。与运行标准的商业工作负载基准的简单的下四行预取器相比，这两种技术相结合，可将L3高速缓存的额外高速缓存访问能力降低多达53％，同时略微提高了性能。

著录项

来源
《IEEE Pacific Rim Conference on Communications, Computers and Signal Processing 》|2013年|206-211|共6页
会议地点
作者
Ono Tarik; Greenstreet Mark R.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Survey of Recent Prefetching Techniques for Processor Caches [J] . Mittal Sparsh ACM Computing Surveys . 2017 ,第2期

机译：最新的处理器缓存预取技术概述
2. Improving Trace Cache Processor Performance by Trace Cache Hierarchy and Path-based Trace Prefetch [J] . WANG, Kaifeng, JI, 电子学报：英文版 . 2006 ,第002期

机译：通过跟踪缓存层次结构和基于路径的跟踪预取来提高跟踪缓存处理器的性能
3. Information Caching and Prefetching Using Collaborative Caching and Prefetching Algorithm in Wireless ADHOC Networks [J] . Manasa Vale International Journal of Innovative Research in Science, Engineering and Technology . 2018 ,第2期

机译：无线ADHOC网络中使用协作缓存和预取算法的信息缓存和预取
4. Cache Prefetching and Speculation on Multi-Threaded Processors [C] . Tarik Ono, Mark R. Greenstreet IEEE Pacific Rim Conference on Communications, Computers and Signal Processing . 2013

机译：多线程处理器上的缓存预取和猜测
5. A cache-based prefetching memory system for mediaprocessors. [D] . Berg, Stefan Georg. 2002

机译：用于媒体处理器的基于缓存的预取存储系统。
6. Combining Instruction Prefetching with Partial Cache Locking to Improve WCET in Real-Time Systems [O] . Fan Ni, Xiang Long, Han Wan, -1

机译：将指令预取与部分缓存锁定相结合以改善实时系统中的WCET
7. Adaptive Granularity Based Last-Level Cache Prefetching Method with eDRAM Prefetch Buffer for Graph Processing Applications [O] . Sae-Gyeol Choi, Jeong-Geun Kim, Shin-Dug Kim 2021

机译：基于Adapram预取缓冲区的自适应粒度基于最后级缓存预取方法，用于图形处理应用程序
8. Effectiveness of Caches and Data Prefetch Buffers in Large-Scale Shared Memory Multiprocessors [R] . Lee, R. L. 1987

机译：大规模共享存储器多处理器中高速缓存和数据预取缓冲区的有效性

Cache prefetching and speculation on multi-threaded processors

摘要

著录项

相似文献

相关主题

期刊订阅