【24h】

Memory Prefetching Using Adaptive Stream Detection

机译:使用自适应流检测的内存预取

获取原文

摘要

We present Adaptive Stream Detection, a simple technique for modulating the aggressiveness of a stream prefetcher to match a workload's observed spatial locality. We use this concept to design a prefetcher that resides on an on-chip memory controller. The result is a prefetcher with small hardware costs that can exploit workloads with low amounts of spatial locality. Using highly accurate simulators for the IBM Power5+, we show that this prefetcher improves performance of the SPEC2006fp benchmarks by an average of 32.7% when compared against a Power5+ that performs no prefetching. On a set of 5 commercial benchmarks that have low spatial locality, this prefetcher improves performance by an average of 15.1%. When compared against a typical Power5+ that does perform processor-side prefetching, the average performance improvement of these benchmark suites is 10.2% and 8.4%. We also evaluate the power and energy impact of our technique. For the same benchmark suites, DRAM power consumption increases by less than 3%, while energy usage decreases by 9.8% and 8.2%, respectively. Moreover, the power consumption of the prefetcher itself is low; it is estimated to increase the power consumption of the Power5+ chip by 0.06%.
机译:我们提出了自适应流检测,这是一种简单的技术,用于调制流预取器的攻击性以匹配工作负载的观察到的空间局部性。我们使用这个概念来设计驻留在片上存储器控制器上的预取器。结果是硬件成本较低的预取器,可以利用空间局部性较低的工作负载。通过使用针对IBM Power5 +的高精度仿真器,我们证明,与不执行预取的Power5 +相比,该预取器将SPEC2006fp基准测试的性能平均提高了32.7%。在一组5个具有较低空间局部性的商业基准上,此预取器将性能平均提高了15.1%。与执行处理器端预取的典型Power5 +相比,这些基准套件的平均性能提高了10.2%和8.4%。我们还评估了我们技术的功率和能量影响。对于相同的基准套件,DRAM功耗增加不到3%,而能耗则分别减少了9.8%和8.2%。而且,预取器本身的功耗很低。据估计,Power5 +芯片的功耗将增加0.06%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号