首页> 外国专利> GENERATING AND PROVIDING PROPOSED DIGITAL ACTIONS IN HIGH-DIMENSIONAL ACTION SPACES USING REINFORCEMENT LEARNING MODELS

GENERATING AND PROVIDING PROPOSED DIGITAL ACTIONS IN HIGH-DIMENSIONAL ACTION SPACES USING REINFORCEMENT LEARNING MODELS

机译:使用强化学习模型在高维动作空间中生成和提供建议的数字动作

摘要

The present disclosure relates to generating proposed digital actions in high-dimensional action spaces for client devices utilizing reinforcement learning models. For example, the disclosed systems can utilize a supervised machine learning model to train a latent representation decoder to determine proposed digital actions based on latent representations. Additionally, the disclosed systems can utilize a latent representation policy gradient model to train a state-based latent representation generation policy to generate latent representations based on the current state of client devices. Subsequently, the disclosed systems can identify the current state of a client device and a plurality of available actions, utilize the state-based latent representation generation policy to generate a latent representation based on the current state, and utilize the latent representation decoder to determine a proposed digital action from the plurality of available actions by analyzing the latent representation.
机译:本公开涉及利用增强学习模型在客户端设备的高维动作空间中生成提议的数字动作。例如,公开的系统可以利用监督机器学习模型来训练潜在表示解码器,以基于潜在表示来确定提议的数字动作。另外,所公开的系统可以利用潜在表示策略梯度模型来训练基于状态的潜在表示生成策略以基于客户端设备的当前状态来生成潜在表示。随后,所公开的系统可以识别客户端设备的当前状态和多个可用动作,利用基于状态的潜在表示生成策略来基于当前状态来产生潜在表示,并利用潜在表示解码器来确定通过分析潜在表示,从多个可用动作中提出数字动作。

著录项

  • 公开/公告号US2020241878A1

    专利类型

  • 公开/公告日2020-07-30

    原文格式PDF

  • 申请/专利权人 ADOBE INC.;

    申请/专利号US201916261092

  • 发明设计人 YASH CHANDAK;GEORGIOS THEOCHAROUS;

    申请日2019-01-29

  • 分类号G06F9/38;G06F9/48;G06N3/08;G06N20;

  • 国家 US

  • 入库时间 2022-08-21 11:22:01

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号