Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques

Skadron K.; Ahuja P.S.

首页> 外文期刊>IEEE Transactions on Computers >Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques

【24h】

Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques

机译：分支预测，指令窗口大小和高速缓存大小：性能折衷和仿真技术

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Design parameters interact in complex ways in modern processors, especially because out-of-order issue and decoupling buffers allow latencies to be overlapped. Trade-offs among instruction-window size, branch-prediction accuracy, and instruction- and data-cache size can change as these parameters move through different domains. For example, modeling unrealistic caches can under- or overstate the benefits of better prediction or a larger instruction window. Avoiding such pitfalls requires understanding how all these parameters interact. Because such methodological mistakes are common, this paper provides a comprehensive set of SimpleScalar simulation results from SPECint95 programs, showing the interactions among these major structures. In addition to presenting this database of simulation results, major mechanisms driving the observed trade-offs are described. The paper also considers appropriate simulation techniques when sampling full-length runs with the SPEC reference inputs. In particular, the results show that branch mispredictions limit the benefits of larger instruction windows, that better branch prediction and better instruction cache behavior have synergistic effects, and that the benefits of larger instruction windows and larger data caches trade off and have overlapping effects. In addition, simulations of only 50 million instructions can yield representative results if these short windows are carefully selected.

机译：设计参数在现代处理器中以复杂的方式进行交互，尤其是因为乱序问题和去耦缓冲区使等待时间重叠。随着这些参数在不同域中的移动，指令窗口大小，分支预测精度以及指令和数据高速缓存大小之间的权衡可能会发生变化。例如，对不现实的缓存建模可能会低估或夸大更好的预测或更大的指令窗口的好处。避免此类陷阱需要了解所有这些参数如何相互作用。由于此类方法错误很常见，因此本文提供了来自SPECint95程序的一整套SimpleScalar仿真结果，显示了这些主要结构之间的相互作用。除了提供此模拟结果数据库之外，还描述了驱动观察到的折衷的主要机制。当使用SPEC参考输入进行全长运行采样时，本文还考虑了适当的仿真技术。尤其是，结果表明，分支错误预测限制了较大的指令窗口的好处，更好的分支预测和更好的指令缓存行为具有协同作用，并且较大的指令窗口和更大的数据缓存的好处之间存在权衡并具有重叠的作用。此外，如果精心选择这些短窗口，则仅5000万条指令的模拟就可以产生具有代表性的结果。

著录项

来源
《IEEE Transactions on Computers》 |1999年第11期|P.1260-1281|共22页
作者
Skadron K.; Ahuja P.S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Writing Efficient, Effective Webcode For Domino, Part 3 techniques For Limiting Http Requests, Reducing File Sizes, And Caching [J] . Joacim Boive THE VIEW . 2008,第6期

机译：为Domino编写高效的Web代码，第3部分，用于限制Http请求，减小文件大小和缓存的技术
2. Benefits of small-sized caches for scatter-hoarding rodents: Influence of cache size, depth, and soil moisture [J] . Geluso K Journal of Mammalogy . 2005,第6期

机译：小型缓存器对散布ho鼠的好处：缓存器大小，深度和土壤湿度的影响
3. In silico predictions of LH2 ring sizes from the crystal structure of a single subunit using molecular dynamics simulations. [J] . Janosi L, Keer H, Cogdell RJ, Proteins: Structure, Function, and Genetics . 2011,第7期

机译：使用分子动力学模拟从单个亚基的晶体结构对LH2环大小进行计算机预测。
4. Patch size setup and performance/cost trade-offs in multi-objective antenna optimization using domain patching technique [C] . Slawomir Koziel, Adrian Bekasiewicz, Qingsha S. Cheng 2016 IEEE MTT-S International Conference on Numerical Electromagnetic and Multiphysics Modeling and Optimization . 2016

机译：使用域修补技术的多目标天线优化中的修补尺寸设置和性能/成本折衷
5. Performance analysis of processor cache memory with adaptive line size. [D] . Hernandez Tapia, Jesus. 2009

机译：具有自适应行大小的处理器缓存的性能分析。
6. Important considerations for protein analyses using antibody based techniques: down-sizing Western blotting up-sizes outcomes [O] . Robyn M Murphy, Graham D Lamb 2013

机译：使用基于抗体的技术进行蛋白质分析的重要注意事项：缩小Western印迹的尺寸以扩大结果
7. Branch prediction, instruction-window size, and cache size: Performance tradeoffs and simulation techniques [O] . Kevin Skadron, Pritpal S. Ahuja, Margaret Martonosi, 1999

机译：分支预测，指令窗口大小和缓存大小：性能折衷和仿真技术
8. Performance Predictions for an Intermediate-Sized VAWT (Vertical Axis Wind Turbine) Based on Performance of the 34-M VAWT Test Bed. [R] . Dodd, H. M. 1989

机译：基于34-m VaWT试验台性能的中型VaWT（垂直轴风力发电机）性能预测。

Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅