首页> 外文学位 >Integrating advice with reinforcement learning.
【24h】

Integrating advice with reinforcement learning.

机译:将建议与强化学习相结合。

获取原文
获取原文并翻译 | 示例

摘要

Reinforcement learning has proven to be an effective method for creating intelligent agents in a wide range of applications. However, it suffers from the need for a large number of training episodes, a problem that is especially noticeable in tasks that need to be learned on-line. Reinforcement learning further deteriorates in tasks that generate insufficient feedback and/or have long inter-reinforcement times.; We extend the traditional reinforcement learning approach with an external teacher that provides additional instruction to the learning agent. The agent incorporates both the reinforcements and the external instruction to obtain a combined policy that is correct with respect to the task and benefits from the teacher's advice. The learning agent converts the instructions to an extended or user reward function that, together with the task reward function, defines a composite reward function that more accurately defines the teacher's perception of the task.
机译:事实证明,强化学习是在各种应用中创建智能代理的有效方法。但是,它需要大量的训练时间,这在需要在线学习的任务中尤为明显。强化学习在产生反馈不足和/或长时间强化时间的任务中进一步恶化。我们通过外部老师扩展了传统的强化学习方法,该老师为学习代理提供了额外的指导。代理人结合了增援和外部指导,以得到一项结合任务正确的政策,并从老师的建议中受益。学习代理将指令转换为扩展或用户奖励功能,该功能与任务奖励功能一起定义了复合奖励功能,该复合奖励功能更准确地定义了教师对任务的理解。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号