...
首页> 外文期刊>Journal of Parallel and Distributed Computing >Accurately modeling superscalar processor performance with reduced trace
【24h】

Accurately modeling superscalar processor performance with reduced trace

机译:精确建模超标量处理器性能,减少跟踪

获取原文
获取原文并翻译 | 示例
           

摘要

Trace-driven simulation of out-of-order superscalar processors is far from straightforward. The dynamic nature of out-of-order superscalar processors combined with the static nature of traces can lead to large inaccuracies in the results when the traces contain only a subset of executed instructions for trace reduction. In this paper, we describe and comprehensively evaluate the pairwise dependent cache miss model (PDCM), a framework for fast and accurate trace-driven simulation of out-of-order superscalar processors. The model determines how to treat a cache miss with respect to other cache misses recorded in the trace by dynamically reconstructing the reorder buffer state during simulation and honoring the dependencies between the trace items. Our experimental results demonstrate that a PDCM-based simulator produces highly accurate simulation results (less than 3% error) with fast simulation speeds (62.5× on average) compared with an execution-driven simulator. Moreover, we observed that the proposed simulation method is capable of preserving a processor's dynamic off-core memory access behavior and accurately predicting the relative performance change when a processor's low-level memory hierarchy parameters are changed.
机译:跟踪驱动的无序超标量处理器仿真远非那么简单。当跟踪仅包含用于减少跟踪的已执行指令的子集时,乱序的超标量处理器的动态特性与跟踪的静态特性相结合会导致结果中的大量误差。在本文中,我们描述并全面评估了成对相关的高速缓存未命中模型(PDCM),该模型可用于快速,准确地跟踪驱动的无序超标量处理器仿真。该模型通过在仿真过程中动态重构重排序缓冲区状态并遵循跟踪项之间的依赖性,来确定相对于跟踪中记录的其他缓存未命中如何处理缓存未命中。我们的实验结果表明,与基于执行驱动的仿真器相比,基于PDCM的仿真器可产生高精度的仿真结果(误差小于3%),仿真速度快(平均为62.5倍)。此外,我们观察到,所提出的仿真方法能够保留处理器的动态内核外内存访问行为,并能够准确预测当处理器的低级内存层次结构参数发生更改时的相对性能变化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号