首页>
外国专利>
METHOD FOR REINFORCEMENT LEARNING, RECORDING MEDIUM STORING REINFORCEMENT LEARNING PROGRAM, AND REINFORCEMENT LEARNING APPARATUS
METHOD FOR REINFORCEMENT LEARNING, RECORDING MEDIUM STORING REINFORCEMENT LEARNING PROGRAM, AND REINFORCEMENT LEARNING APPARATUS
展开▼
机译:增强学习方法,记录介质存储加强学习程序,以及加强学习设备
展开▼
页面导航
摘要
著录项
相似文献
摘要
A method for reinforcement learning performed by a computer is disclosed. The method includes: predicting a state of a target to be controlled in reinforcement learning at each time point to measure a state of the target, the time point being included in a period from a time point to determine a present action to a time point to determine a subsequent action; calculating a degree of risk concerning the state of the target at the each time point with respect to a constraint condition based on a result of prediction; specifying a search range concerning the present action to the target in accordance with the calculated degree of risk and a degree of impact of the present action to the target on the state of the target at the each time point; and determining the present action to the target based on the specified search range.
展开▼