A performance study of instruction cache prefetching methods

Hsu W.-C.; Smith J.E.

首页> 外文期刊>IEEE Transactions on Computers >A performance study of instruction cache prefetching methods

【24h】

A performance study of instruction cache prefetching methods

机译：指令缓存预取方法的性能研究

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Prefetching methods for instruction caches are studied via trace-driven simulation. The two primary methods are "fall-through" prefetch (sometimes referred to as "one block lookahead") and "target" prefetch. Fall-through prefetches are for sequential line accesses, and a key parameter is the distance from the end of the current line where the prefetch for the next line is initiated. Target prefetches work also for nonsequential line accesses. A prediction table is used and a key aspect is the prediction algorithm implemented by the table. Fall-through prefetch and target prefetch each improve performance significantly. When combined in a hybrid algorithm, their performance improvement is nearly additive. An instruction cache using a combined target and fall-through method can provide the same performance as a two to four times larger cache that does not prefetch. A good prediction method must not only be accurate, but prefetches must be initiated early enough to allow time for the instructions to return from main memory. To quantify this, we define a "prefetch efficiency" measure that reflects the amount of memory fetch delay that may be successfully hidden by prefetching. The better prefetch methods (in terms of miss rate) also have very high efficiencies, hiding approximately 90 percent of the miss delay for prefetched lines. Another performance measure of interest is memory traffic. Without prefetching, large line sizes give better hit rates; with prefetching, small line sizes tend to give better overall hit rates. Because smaller line sizes tend to reduce memory traffic, the top-performing prefetch caches produce less memory traffic than the top-performing nonprefetch caches of the same size.

机译：通过跟踪驱动仿真研究了指令缓存的预取方法。两种主要方法是“穿透式”预取（有时称为“一个块先行”）和“目标”预取。直通式预取用于顺序的行访问，关键参数是距当前行的末尾的距离，在该行中开始下一行的预取。目标预取对于非顺序行访问也起作用。使用预测表，关键方面是由该表实现的预测算法。穿透预取和目标预取均显着提高了性能。当结合使用混合算法时，它们的性能提高几乎是相加的。使用结合了目标和穿透方法的指令高速缓存可以提供与未预取的大两倍至四倍的高速缓存相同的性能。一个好的预测方法不仅必须是准确的，而且必须足够早地启动预取，以便有时间让指令从主存储器返回。为了对此进行量化，我们定义了一个“预取效率”度量，该度量反映了可以通过预取成功隐藏的内存取回延迟量。更好的预取方法（就未命中率而言）也具有很高的效率，对于预取的行而言，大约可隐藏90％的未命中延迟。另一个令人关注的性能指标是内存流量。如果不进行预取，则较大的行尺寸会带来更好的命中率；使用预取时，较小的行尺寸往往会带来更好的总体命中率。由于较小的行大小倾向于减少内存流量，因此性能最佳的预取缓存比相同大小的性能最佳的非预取缓存产生更少的内存流量。

著录项

来源
《IEEE Transactions on Computers》 |1998年第5期|P.497-508|共12页
作者
Hsu W.-C.; Smith J.E.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Non-referenced prefetch (NRP) cache for instruction prefetching [J] . Park G.-H., Kwon O.-Y. IEE proceedings. Part E . 1996,第1期

机译：非参考预取（NRP）高速缓存，用于指令预取
2. Analyzing the Worst-Case Execution Time for Instruction Caches with Prefetching [J] . JUN YAN, WEI ZHANG ACM Transactions on Embedded Computing Systems . 2009,第1期

机译：通过预取分析指令高速缓存的最坏情况执行时间
3. WCET analysis of instruction caches with prefetching [J] . Jun Yan, Wei Zhang ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages . 2007,第7期

机译：通过预取对指令缓存进行WCET分析
4. Analyzing instruction prefetching techniques via a cache performance model: effectiveness and limitations [C] . Gi-Ho Park, Tack-Don Han Performance, Computing, and Communications Conference, 2000. IPCCC '00. Conference Proceeding of the IEEE International . 2000

机译：通过缓存性能模型分析指令预取技术：有效性和局限性
5. Adaptive Cache Prefetching using Machine Learning and Monitoring Hardware Performance Counters. [D] . Maldikar, Pranita. 2014

机译：使用机器学习和监视硬件性能计数器的自适应缓存预取。
6. Combining Instruction Prefetching with Partial Cache Locking to Improve WCET in Real-Time Systems [O] . Fan Ni, Xiang Long, Han Wan, -1

机译：将指令预取与部分缓存锁定相结合以改善实时系统中的WCET
7. A performance study of instruction cache prefetching methods [O] . Hsu, Wei-Chung, Smith, James E. 1998

机译：指令缓存预取方法的性能研究

A performance study of instruction cache prefetching methods

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅