首页> 外国专利> METHOD FOR REINFORCEMENT LEARNING, RECORDING MEDIUM STORING REINFORCEMENT LEARNING PROGRAM, AND REINFORCEMENT LEARNING APPARATUS

METHOD FOR REINFORCEMENT LEARNING, RECORDING MEDIUM STORING REINFORCEMENT LEARNING PROGRAM, AND REINFORCEMENT LEARNING APPARATUS

机译:增强学习方法,记录介质存储加强学习程序,以及加强学习设备

摘要

A method for reinforcement learning performed by a computer is disclosed. The method includes: predicting a state of a target to be controlled in reinforcement learning at each time point to measure a state of the target, the time point being included in a period from a time point to determine a present action to a time point to determine a subsequent action; calculating a degree of risk concerning the state of the target at the each time point with respect to a constraint condition based on a result of prediction; specifying a search range concerning the present action to the target in accordance with the calculated degree of risk and a degree of impact of the present action to the target on the state of the target at the each time point; and determining the present action to the target based on the specified search range.
机译:公开了一种由计算机执行的增强学习方法。该方法包括:预测在每个时间点在每个时间点测量目标的加强学学习的状态,以测量目标的状态,所包括的时间点在从时间点确定当前动作到时间点的时间点确定随后的行动;基于预测结果计算关于每个时间点的目标的危险程度;根据计算的风险程度和当前动作对每个时间点的靶状态的目标的风险程度和对目标的影响程度指定关于目标的搜索范围;并基于指定的搜索范围确定对目标的当前动作。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号