...
首页> 外文期刊>Operating systems review >Compiler Orchestrated Prefetching via Speculation and Predication
【24h】

Compiler Orchestrated Prefetching via Speculation and Predication

机译:编译器通过推测和预测进行的预取

获取原文
获取原文并翻译 | 示例
           

摘要

This paper introduces a compiler-orchestrated prefetching system as a unified framework geared toward ameliorating the gap between processing speeds and memory access latencies. We focus the scope of the optimization on specific subsets of the program dependence graph that succinctly characterize the memory access pattern of both regular array-based applications and irregular pointer-intensive programs. We illustrate how program embedded precomputation via speculative execution can accurately predict and effectively prefetch future memory references with negligible overhead. The proposed techniques reduce the total running time of seven SPEC benchmarks and two OLDEN benchmarks by 27% on an Itanium 2 processor. The improvements are in addition to several state-of-the-art optimizations including software pipelining and data prefetching. In addition, we use cycle-accurate simulations to identify important and lightweight architectural innovations that further mitigate the memory system bottleneck. In particular, we focus on the notoriously challenging class of pointer-chasing applications, and demonstrate how they may benefit from a novel scheme of sentineled prefetching. Our results for twelve SPEC benchmarks demonstrate that 45% of the processor stalls that are caused by the memory system are avoidable. The techniques in this paper can effectively mask long memory latencies with little instruction overhead, and can readily contribute to the performance of processors today.
机译:本文介绍了编译器精心策划的预取系统,它是一个统一的框架,旨在缓解处理速度和内存访问延迟之间的差距。我们将优化的范围集中在程序依赖图的特定子集上,该子集简洁地描述了基于规则的基于数组的应用程序和不规则的指针密集型程序的内存访问模式。我们将说明通过推测性执行的程序嵌入式预计算如何能够以可忽略的开销准确预测并有效地预取将来的内存引用。所建议的技术在Itanium 2处理器上将七个SPEC基准测试和两个OLDEN基准测试的总运行时间减少了27%。这些改进是对一些最先进的优化的补充,包括软件流水线和数据预取。此外,我们使用精确的周期仿真来识别重要的轻量级体系结构创新,从而进一步缓解内存系统瓶颈。尤其是,我们专注于臭名昭著的具有挑战性的指针追逐应用程序类别,并展示它们如何从哨兵预取的新颖方案中受益。我们针对十二个SPEC基准测试的结果表明,可以避免由内存系统引起的处理器停顿的45%。本文中的技术可以以很少的指令开销有效地掩盖较长的存储延迟,并且可以轻松地为当今的处理器性能做出贡献。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号