首页> 外国专利> Using Reinforcement Learning to Dynamically Tune Cache Policy Parameters

Using Reinforcement Learning to Dynamically Tune Cache Policy Parameters

机译:使用钢筋学习动态调整缓存策略参数

摘要

Reinforcement learning is used to dynamically tune cache policy parameters. The current state of a workload on a cache is provided to a reinforcement learning process. The reinforcement learning process uses the cache workload characterization to select an action to be taken to adjust a value of one of multiple parameterized cache policies used to control operation of a cache. The adjusted value is applied to the cache for an upcoming time interval. At the end of the time interval, a reward associated with the action is determined, which may be computed by comparing the cache hit rate during the interval with a baseline hit rate. The process iterates until the end of an episode, at which point the parameters of the cache control policies are reset. The episode is used to train the reinforcement learning policy so that the reinforcement learning process converges to a trained state.
机译:强化学习用于动态调整缓存策略参数。 将高速缓存上的工作负载状态提供给加强学习过程。 加强学习过程使用缓存工作负载表征来选择要采取的动作来调整用于控制缓存操作的多个参数化缓存策略之一的值。 调整后的值被应用于高速缓存以获取即将到来的时间间隔。 在时间间隔的末尾,确定与动作相关联的奖励,可以通过将缓存命中率与基线命中率的间隔进行比较来计算。 该过程迭代直到剧集的末尾,此时重置高速缓存控制策略的参数。 该集中用于训练加强学习政策,使得加强学习过程会聚到训练状态。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号