首页> 外文会议>Annual IEEE/ACM International Symposium on Microarchitecture >SHIFT: Shared history instruction fetch for lean-core server processors
【24h】

SHIFT: Shared history instruction fetch for lean-core server processors

机译:shift:共享历史记录指令获取leir-core服务器处理器

获取原文

摘要

In server workloads, large instruction working sets result in high L1 instruction cache miss rates. Fast access requirements preclude large instruction caches that can accommodate the deep software stacks prevalent in server applications. Prefetching has been a promising approach to mitigate instruction-fetch stalls by relying on recurring instruction streams of server workloads to predict future instruction misses. By recording and replaying instruction streams from dedicated storage next to each core, stream-based prefetchers have been shown to overcome instruction fetch stalls. Problematically, existing stream-based prefetchers incur high history storage costs resulting from large instruction working sets and complex control flow inherent in server workloads. The high storage requirements of these prefetchers prohibit their use in emerging lean-core server processors. We introduce Shared History Instruction Fetch, SHIFT, an instruction prefetcher suitable for lean-core server processors. By sharing the history across cores, SHIFT minimizes the cost per core without sacrificing miss coverage. Moreover, by embedding the shared instruction history in the LLC, SHIFT obviates the need for dedicated instruction history storage, while transparently enabling multiple instruction histories in the presence of workload consolidation. In a 16-core server CMP, SHIFT eliminates 81% (up to 93%) of instruction cache misses, achieving 19% (up to 42%) speedup on average. SHIFT captures 90% of the performance benefit of the state-of-the-art instruction prefetcher at 14× less storage cost.
机译:在服务器工作负载中,大的指令工作集会导致高L1指令缓存未命中率。快速访问要求排除了可以容纳服务器应用程序中普遍存在的深软件堆栈的大型指令缓存。预取通过依赖于服务器工作负载的重复指令流来预测指令获取档位是一种有希望的方法来预测未来的指令未命中。通过在每个核心旁边的专用存储中记录和重放指令流,已显示基于流的预取器来克服指令获取档位。有问题地,现有的基于流的预取器产生高历史存储成本,从大型指令工作集和服务器工作负载中固有的复杂控制流程产生。这些预取器的高存储要求禁止其在新兴瘦核服务器处理器中使用。我们介绍共享历史记录指令提取,Shift,适用于瘦核服务器处理器的指令预取器。通过分享核心的历史,因此移动最小化每核的成本,而不会牺牲错过覆盖范围。此外,通过在LLC中嵌入共享指令历史,Shift避免了对专用指令历史存储的需要,同时透明地在存在工作负载整合中启用多个指令历史。在16核服务器CMP中,Shift消除了81%(高达93%)的指令高速缓存未命中,平均实现了19%(高达42%)的加速。转变捕获最先进的指令预取符合14倍的储存成本的90%的性能优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号