首页> 外文会议>IEEE International Conference on Computer Design >Improving cache performance by combining cost-sensitivity and locality principles in cache replacement algorithms
【24h】

Improving cache performance by combining cost-sensitivity and locality principles in cache replacement algorithms

机译:通过组合缓存替换算法中的成本灵敏度和局部原理来提高缓存性能

获取原文

摘要

Due to the ever increasing performance gap between the processor and the main memory, it becomes crucial to bridge that gap by designing an efficient memory hierarchy capable of reducing the average memory access time. The cache replacement algorithm plays a central role in designing an efficient memory hierarchy. Many of the recent studies in cache replacement algorithms have focused on improving L2 cache replacement algorithms by minimizing the miss count. However, depending on the dependency chain, cache miss bursts, and other factors, a processor's ability to partially hide the cost of an L2 cache miss varies; that is, cache miss costs are not uniform. Therefore, a better solution would account also for the aggregate miss cost in designing cache replacement algorithms. Our proposed solution combines the two principles of locality and cost-sensitivity into one which we call: LACS: Locality-Aware Cost-Sensitive cache replacement algorithm. LACS estimates a cache block's cost from the number of instructions the processor manages to issue during a cache miss on that block and then victimizes cache blocks with low cost and poor locality in order to maximize the overall cache performance. When LACS is evaluated using a uniprocessor architecture model, it speeds up 10 L2 cache performance-constrained SPEC CPU2000 benchmarks by up to 85% and 15% on average while not slowing down any of the 20 SPEC CPU2000 benchmarks evaluated. When evaluated using a dual-core CMP architecture model, LACS speeds up 6 SPEC CPU2000 benchmark pairs by up to 44% and 11% on average.
机译:由于处理器和主存储器之间的性能差异增加,通过设计能够减少平均存储器访问时间的有效内存层级来弥合该间隙至关重要。缓存替换算法在设计有效的内存层次结构方面起着核心作用。最近在高速缓存替换算法中的许多研究都集中在通过最小化错过计数来改进L2缓存替换算法。但是,根据依赖链,缓存未命中突发和其他因素,处理器部分隐藏L2缓存未命中的成本的能力变化;也就是说,缓存未命中成本不均匀。因此,在设计缓存替换算法中,还可以考虑更好的解决方案。我们所提出的解决方案将本地的两个原则与成本敏感性相结合,进入了我们所召唤的一个原则:LACS:地方感知成本敏感缓存替换算法。 LAC估计缓存块从处理器管理在该块上的缓存未命令期间发出的指令数量,然后将缓存块造成低成本和众所周知的缓存块,以最大化整体高速缓存性能。当使用单处理器架构模型进行评估LAC时,它速度高达10 L2缓存性能约束规范CPU2000基准,平均速度高达85%,而且在评估的20个规范CPU2000基准中的任何一个时,速度不变。使用双核CMP架构模型进行评估时,LACS平均速度高达4个规格CPU2000基准对,平均高达44%和11%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号