首页> 外文会议> >The performance of runtime data cache prefetching in a dynamic optimization system
【24h】

The performance of runtime data cache prefetching in a dynamic optimization system

机译:动态优化系统中运行时数据高速缓存预取的性能

获取原文

摘要

Traditional software controlled data cache prefetching is often ineffective due to the lack of runtime cache miss and miss address information. To overcome this limitation, we implement runtime data cache prefetching in the dynamic optimization system ADORE (ADaptive Object code Reoptimization). Its performance has been compared with static software prefetching on the SPEC2000 benchmark suite. Runtime cache prefetching shows better performance. On an Itanium 2 based Linux workstation, it can increase performance by more than 20% over static prefetching on some benchmarks. For benchmarks that do not benefit from prefetching, the runtime optimization system adds only 1%-2% overhead. We have also collected cache miss profiles to guide static data cache prefetching in the ORC compiler. With that information the compiler can effectively avoid generating prefetches for loops that hit well in the data cache.
机译:由于缺少运行时高速缓存未命中和未命中地址信息,传统的软件控制的数据高速缓存预取通常无效。为了克服此限制,我们在动态优化系统ADORE(自适应目标代码重新优化)中实现了运行时数据缓存预取。它的性能已与SPEC2000基准测试套件上的静态软件预取进行了比较。运行时缓存预取显示了更好的性能。在基于Itanium 2的Linux工作站上,与某些基准测试上的静态预取相比,它可以将性能提高20%以上。对于无法从预取中受益的基准,运行时优化系统仅增加1%-2%的开销。我们还收集了高速缓存未命中配置文件,以指导在ORC编译器中进行静态数据高速缓存预取。利用这些信息,编译器可以有效地避免为在数据高速缓存中命中的循环生成预取。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号