首页> 外国专利> RECORDING MEDIUM THAT STORES REINFORCEMENT LEARNING PROGRAM, REINFORCEMENT LEARNING METHOD, AND REINFORCEMENT LEARNING APPARATUS

RECORDING MEDIUM THAT STORES REINFORCEMENT LEARNING PROGRAM, REINFORCEMENT LEARNING METHOD, AND REINFORCEMENT LEARNING APPARATUS

机译：存储增强学习程序，增强学习方法和增强学习设备的记录介质

页面导航

摘要
著录项
相似文献

摘要

A reinforcement learning method is performed by a computer. The method includes: acquiring an input value related to a state and an action of a control target and a gain of the control target that corresponds to the input value; estimating coefficients of state-action value function that becomes a polynomial for a variable that represents the action of the control target, or becomes a polynomial for a variable that represents the action of the control target when a value is substituted for a variable that represents the state of the control target, based on the acquired input value and the gain; and obtaining an optimum action or an optimum value of the state-action value function with the estimated coefficients by using a quantifier elimination.

机译：强化学习方法由计算机执行。该方法包括：获取与控制目标的状态和动作有关的输入值以及与该输入值相对应的控制目标的增益;以及估计状态作用值函数的系数，该函数成为代表控制目标的作用的变量的多项式，或者成为用值代替代表控制目标的作用的变量的多项式。基于所获取的输入值和增益，控制目标的状态;通过量化器消除，得到具有估计系数的状态作用值函数的最佳作用或最佳值。

著录项

公开/公告号US2020184277A1

专利类型
公开/公告日2020-06-11

原文格式PDF
申请/专利权人 FUJITSU LIMITED;
展开▼

申请/专利号US201916702676
发明设计人 HIDENAO IWANE;TOMOTAKE SASAKI;HITOSHI YANAMI;
展开▼

申请日2019-12-04
分类号G06K9/62;G05B13/02;
国家 US
入库时间 2022-08-21 11:27:33

相似文献

专利
外文文献
中文文献