首页> 外文期刊>IEEE Transactions on Computers >Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques
【24h】

Branch prediction, instruction-window size, and cache size: performance trade-offs and simulation techniques

机译:分支预测,指令窗口大小和高速缓存大小:性能折衷和仿真技术

获取原文
获取原文并翻译 | 示例

摘要

Design parameters interact in complex ways in modern processors, especially because out-of-order issue and decoupling buffers allow latencies to be overlapped. Trade-offs among instruction-window size, branch-prediction accuracy, and instruction- and data-cache size can change as these parameters move through different domains. For example, modeling unrealistic caches can under- or overstate the benefits of better prediction or a larger instruction window. Avoiding such pitfalls requires understanding how all these parameters interact. Because such methodological mistakes are common, this paper provides a comprehensive set of SimpleScalar simulation results from SPECint95 programs, showing the interactions among these major structures. In addition to presenting this database of simulation results, major mechanisms driving the observed trade-offs are described. The paper also considers appropriate simulation techniques when sampling full-length runs with the SPEC reference inputs. In particular, the results show that branch mispredictions limit the benefits of larger instruction windows, that better branch prediction and better instruction cache behavior have synergistic effects, and that the benefits of larger instruction windows and larger data caches trade off and have overlapping effects. In addition, simulations of only 50 million instructions can yield representative results if these short windows are carefully selected.
机译:设计参数在现代处理器中以复杂的方式进行交互,尤其是因为乱序问题和去耦缓冲区使等待时间重叠。随着这些参数在不同域中的移动,指令窗口大小,分支预测精度以及指令和数据高速缓存大小之间的权衡可能会发生变化。例如,对不现实的缓存建模可能会低估或夸大更好的预测或更大的指令窗口的好处。避免此类陷阱需要了解所有这些参数如何相互作用。由于此类方法错误很常见,因此本文提供了来自SPECint95程序的一整套SimpleScalar仿真结果,显示了这些主要结构之间的相互作用。除了提供此模拟结果数据库之外,还描述了驱动观察到的折衷的主要机制。当使用SPEC参考输入进行全长运行采样时,本文还考虑了适当的仿真技术。尤其是,结果表明,分支错误预测限制了较大的指令窗口的好处,更好的分支预测和更好的指令缓存行为具有协同作用,并且较大的指令窗口和更大的数据缓存的好处之间存在权衡并具有重叠的作用。此外,如果精心选择这些短窗口,则仅5000万条指令的模拟就可以产生具有代表性的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号