首页> 外国专利> METHOD FOR REINFORCEMENT LEARNING, RECORDING MEDIUM STORING REINFORCEMENT LEARNING PROGRAM, AND REINFORCEMENT LEARNING APPARATUS

METHOD FOR REINFORCEMENT LEARNING, RECORDING MEDIUM STORING REINFORCEMENT LEARNING PROGRAM, AND REINFORCEMENT LEARNING APPARATUS

机译：增强学习方法，记录介质存储加强学习程序，以及加强学习设备

页面导航

摘要
著录项
相似文献

摘要

A method for reinforcement learning performed by a computer is disclosed. The method includes: predicting a state of a target to be controlled in reinforcement learning at each time point to measure a state of the target, the time point being included in a period from a time point to determine a present action to a time point to determine a subsequent action; calculating a degree of risk concerning the state of the target at the each time point with respect to a constraint condition based on a result of prediction; specifying a search range concerning the present action to the target in accordance with the calculated degree of risk and a degree of impact of the present action to the target on the state of the target at the each time point; and determining the present action to the target based on the specified search range.

机译：公开了一种由计算机执行的增强学习方法。该方法包括：预测在每个时间点在每个时间点测量目标的加强学学习的状态，以测量目标的状态，所包括的时间点在从时间点确定当前动作到时间点的时间点确定随后的行动;基于预测结果计算关于每个时间点的目标的危险程度;根据计算的风险程度和当前动作对每个时间点的靶状态的目标的风险程度和对目标的影响程度指定关于目标的搜索范围;并基于指定的搜索范围确定对目标的当前动作。

著录项

公开/公告号US2021063974A1

专利类型
公开/公告日2021-03-04

原文格式PDF
申请/专利权人 FUJITSU LIMITED;
展开▼

申请/专利号US202017001706
发明设计人 YOSHIHIRO OKAWA;TOMOTAKE SASAKI;HIDENAO IWANE;HITOSHI YANAMI;
展开▼

申请日2020-08-25
分类号G05B13/04;G06N5/02;G06N7;G05B17/02;
国家 US
入库时间 2022-08-24 17:30:04

相似文献

专利
外文文献
中文文献