首页> 外文期刊>Computer architecture news >Accelerating Dependent Cache Misses with an Enhanced Memory Controller
【24h】

Accelerating Dependent Cache Misses with an Enhanced Memory Controller

机译:使用增强型内存控制器加速从属高速缓存未命中

获取原文
获取原文并翻译 | 示例
           

摘要

On-chip contention increases memory access latency for multi-core processors. We identify that this additional latency has a substantial effect on performance for an important class of latency-critical memory operations: those that result in a cache miss and are dependent on data from a prior cache miss. We observe that the number of instructions between the first cache miss and its dependent cache miss is usually small. To minimize dependent cache miss latency, we propose adding just enough functionality to dynamically identify these instructions at the core and migrate them to the memory controller for execution as soon as source data arrives from DRAM. This migration allows memory requests issued by our new Enhanced Memory Controller (EMC) to experience a 20% lower latency than if issued by the core. On a set of memory intensive quad-core workloads, the EMC results in a 13% improvement in system performance and a 5% reduction in energy consumption over a system with a Global History Buffer prefetcher, the highest performing prefetcher in our evaluation.
机译:片上竞争会增加多核处理器的内存访问延迟。我们发现,对于一类重要的对延迟至关重要的内存操作,这种额外的延迟会对性能产生重大影响:那些导致高速缓存未命中并依赖于先前的高速缓存未命中的数据的操作。我们观察到,第一个高速缓存未命中及其从属高速缓存未命中之间的指令数量通常很小。为了使相关的高速缓存未命中延迟最小化,我们建议添加足够的功能以在内核处动态识别这些指令,并在源数据从DRAM到达后立即将其迁移到内存控制器以执行。这种迁移使我们新的增强型内存控制器(EMC)发出的内存请求的延迟比内核发出的延迟低20%。在一组内存密集型四核工作负载上,EMC导致系统性能提高了13%,而与具有全球历史记录缓冲区预取器(我们评估中性能最高的预取器)的系统相比,能耗降低了5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号