首页> 外文会议>Performance Analysis of Systems and Software, 2009. ISPASS 2009 >Accurately approximating superscalar processor performance from traces
【24h】

Accurately approximating superscalar processor performance from traces

机译:从迹线精确逼近超标量处理器性能

获取原文

摘要

Trace-driven simulation of superscalar processors is particularly complicated. The dynamic nature of superscalar processors combined with the static nature of traces can lead to large inaccuracies in the results, especially when traces contain only a subset of executed instructions for trace reduction. The main problem in the filtered trace simulation is that the trace does not contain enough information with which one can predict the actual penalty of a cache miss. In this paper, we discuss and evaluate three strategies to quantify the impact of a long latency memory access in a superscalar processor when traces have only L1 cache misses. The strategies are based on models about how a cache miss is treated with respect to other cache misses: (1) isolated cache miss model, (2) independent cache miss model, and (3) pairwise dependent cache miss model. Our experimental results demonstrate that the pairwise dependent cache miss model produces reasonably accurate results (4.8% RMS error) under perfect branch prediction. Our work forms a basis for fast, accurate, and configurable multicore processor simulation using a pre-determined processor core design.
机译:跟踪驱动的超标量处理器仿真特别复杂。超标量处理器的动态特性与跟踪的静态特性相结合,会导致结果中的大量误差,特别是当跟踪仅包含一部分已执行指令以减少跟踪时,尤其如此。过滤的跟踪模拟中的主要问题是,跟踪没有包含足够的信息来预测高速缓存未命中的实际代价。在本文中,我们讨论和评估三种策略,以量化当跟踪只有L1高速缓存未命中时,超标量处理器中长等待时间存储器访问的影响。该策略基于关于如何相对于其他高速缓存未命中如何处理高速缓存未命中的模型:(1)孤立的高速缓存未命中模型,(2)独立的高速缓存未命中模型,以及(3)逐对相关的高速缓存未命中模型。我们的实验结果表明,成对依赖的缓存未命中模型在完美的分支预测下产生了相当准确的结果(4.8%RMS误差)。我们的工作为使用预定的处理器内核设计进行快速,准确和可配置的多核处理器仿真奠定了基础。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号