...
首页> 外文期刊>Computer architecture news >Semantic Locality and Context-based Prefetching Using Reinforcement Learning
【24h】

Semantic Locality and Context-based Prefetching Using Reinforcement Learning

机译:使用强化学习的语义局部性和基于上下文的预取

获取原文
获取原文并翻译 | 示例
           

摘要

Most modern memory prefetchers rely on spatio-temporal locality to predict the memory addresses likely to be accessed by a program in the near future. Emerging workloads, however, make increasing use of irregular data structures, and thus exhibit a lower degree of spatial locality. This makes them less amenable to spatio-temporal prefetchers. In this paper, we introduce the concept of Semantic Locality, which uses inherent program semantics to characterize access relations. We show how, in principle, semantic locality can capture the relationship between data elements in a manner agnostic to the actual data layout, and we argue that semantic locality transcends spatio-temporal concerns. We further introduce the context-based memory prefetcher, which approximates semantic locality using reinforcement learning. The prefetcher identifies access patterns by applying reinforcement learning methods over machine and code attributes, that provide hints on memory access semantics. We test our prefetcher on a variety of benchmarks that employ both regular and irregular patterns. For the SPEC 2006 suite, it delivers speedups as high as 2.8x (20% on average) over a baseline with no prefetching, and outperforms leading spatio-temporal prefetchers. Finally, we show that the context-based prefetcher makes it possible for naive, pointer-based implementations of irregular algorithms to achieve performance comparable to that of spatially optimized code.
机译:大多数现代内存预取器都依赖时空局部性来预测不久的将来程序可能会访问的内存地址。但是,新兴的工作负载越来越多地使用不规则的数据结构,因此显示出较低的空间局部性。这使得它们不适合时空预取器。在本文中,我们介绍了语义局部性的概念,该概念使用固有程序语义来描述访问关系。我们从原则上说明了语义局部性如何以与实际数据布局不可知的方式捕获数据元素之间的关系,并且我们认为语义局部性超越了时空问题。我们进一步介绍了基于上下文的内存预取器,它使用强化学习来近似语义局部性。预取器通过对机器和代码属性应用强化学习方法来识别访问模式,从而提供有关内存访问语义的提示。我们在使用常规和不规则模式的各种基准上测试预取器。对于SPEC 2006套件,它在不进行预取的情况下,可以提供比基准高2.8倍(平均20%)的加速,并且性能优于主要的时空预取器。最后,我们证明了基于上下文的预取器使不规则算法的基于天真的,基于指针的实现有可能实现与空间优化代码相当的性能。

著录项

  • 来源
    《Computer architecture news》 |2015年第3期|285-297|共13页
  • 作者单位

    Electrical Engineering Technion - Israel Institute of Technology;

    Electrical Engineering Technion - Israel Institute of Technology;

    Electrical Engineering Technion - Israel Institute of Technology;

    Electrical Engineering Technion - Israel Institute of Technology,Computer Science Technion - Israel Institute of Technology;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号