首页> 外国专利> Using Reinforcement Learning to Dynamically Tune Cache Policy Parameters

Using Reinforcement Learning to Dynamically Tune Cache Policy Parameters

机译：使用钢筋学习动态调整缓存策略参数

页面导航

摘要
著录项
相似文献

摘要

Reinforcement learning is used to dynamically tune cache policy parameters. The current state of a workload on a cache is provided to a reinforcement learning process. The reinforcement learning process uses the cache workload characterization to select an action to be taken to adjust a value of one of multiple parameterized cache policies used to control operation of a cache. The adjusted value is applied to the cache for an upcoming time interval. At the end of the time interval, a reward associated with the action is determined, which may be computed by comparing the cache hit rate during the interval with a baseline hit rate. The process iterates until the end of an episode, at which point the parameters of the cache control policies are reset. The episode is used to train the reinforcement learning policy so that the reinforcement learning process converges to a trained state.

机译：强化学习用于动态调整缓存策略参数。将高速缓存上的工作负载状态提供给加强学习过程。加强学习过程使用缓存工作负载表征来选择要采取的动作来调整用于控制缓存操作的多个参数化缓存策略之一的值。调整后的值被应用于高速缓存以获取即将到来的时间间隔。在时间间隔的末尾，确定与动作相关联的奖励，可以通过将缓存命中率与基线命中率的间隔进行比较来计算。该过程迭代直到剧集的末尾，此时重置高速缓存控制策略的参数。该集中用于训练加强学习政策，使得加强学习过程会聚到训练状态。

著录项

公开/公告号US2021374523A1

专利类型
公开/公告日2021-12-02

原文格式PDF
申请/专利权人 EMC IP HOLDING COMPANY LLC;
展开▼

申请/专利号US202016889104
发明设计人 VINICIUS MICHEL GOTTIN;TIAGO SALVIANO CALMON;JONAS FURTADO DIAS;ALEX LAIER BORDIGNON;DANIEL SADOC MENASCHÉ;
展开▼

申请日2020-06-01
分类号G06N3/08;G06F16/22;G06F16/2455;G06N3/04;
国家 US
入库时间 2022-08-24 22:34:18

相似文献

专利
外文文献
中文文献