首页> 外文会议>IEEE Pacific Rim Conference on Communications, Computers and Signal Processing >A mechanistic model of memory level parallelism fed with cache miss rates
【24h】

A mechanistic model of memory level parallelism fed with cache miss rates

机译:通过高速缓存未命运率提供的记忆级并行性机械模型

获取原文

摘要

Non-blocking caches, which are commonly utilized in modern out-of-order processors, could handle multiple outstanding memory requests simultaneously to reduce the penalties of long latency cache misses. Memory level parallelism (MLP), which refers to the number of memory requests concurrently held by Miss Status Handling Registers (MSHRs), is an indispensable factor to estimate cache performance. To achieve MLP efficiently, previous researches oversimplified the factors that need to be considered when constructing analytical models, especially for the influences of cache miss rate. By quantifying above cache miss rate effects, this paper proposes a mechanistic model of memory level parallelism, which performs more accurate than existing works. 15 benchmarks, chosen from Mobybench 2.0, Mibench 1.0 and MediaBench II, are adopted for evaluating the accuracy of our model. Compared to Gem5 cycle-accurate simulation results, the largest root mean square error is less than 11%, while the average one is around 7%. Meanwhile, the cache performance forecasting process can be sped up about 38 times compared to the Gem5 cycle-accurate simulations.
机译:通常用于现代处理器的非阻塞缓存,可以同时处理多个未完成的内存请求以减少长期缓存未命中的惩罚。内存级并行性(MLP)是指Miss STATUS处理寄存器同时保持的内存请求数(MSHRS),是估计高速缓存性能的不可或缺的因素。为了有效地实现MLP,之前的研究过度简化了构建分析模型时需要考虑的因素,特别是对于缓存未命中率的影响。通过量化上述缓存未命中率效应,本文提出了内存级并行性的机制模型,其比现有工作更准确。从MobyBench 2.0中选择的15个基准,采用Mibench 1.0和MediaBench II来评估模型的准确性。与GEM5循环准确的仿真结果相比,最大的根均方误差小于11 %,而平均值约为7 %。同时,与GEM5循环准确模拟相比,缓存性能预测过程可以增加大约38次。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号