首页> 外国专利> How to build a trained model and a design support device using the trained model

How to build a trained model and a design support device using the trained model

机译：如何使用训练过的模型构建训练过的模型和设计支持设备

页面导航

摘要
著录项
相似文献

摘要

PROBLEM TO BE SOLVED: To facilitate setting of a reward in a method of constructing a trained model using reinforcement learning and a design support device using the trained model. SOLUTION: A reward R is given to a combination of a state S determined depending on whether or not a design activity is executed and an action A which is an activity selectable under the state, and the value is maximized. This is a method of constructing a trained model in which the trained model 50 is constructed, that is, step ST4 in which a reward is given based on a pre-input rule 16, and step ST5 in which reinforcement learning is performed based on the given reward. Including, the rule contains information on the source activity, the route destination activity to be performed later, and the thickness indicating the importance of the route destination activity to be performed after the route source activity, and in the step of awarding the reward. When the action immediately before reaching the state matches the route source activity and the action and the route destination activity match, the reward is set based on the thickness. [Selection diagram] FIG. 13

机译：需要解决的问题：通过使用强化学习构建训练模型的方法和使用训练模型的设计支持设备，方便设置奖励。解决方案：根据设计活动是否执行而确定的状态S和作为该状态下可选择的活动的动作A的组合会得到奖励R，并且该值会最大化。这是一种构造训练模型的方法，其中构造了训练模型50，即步骤ST4，其中基于预输入规则16给予奖励，步骤ST5，其中基于给定奖励执行强化学习。包括，该规则包含关于源活动、稍后要执行的路由目的地活动、指示在路由源活动之后以及在奖励步骤中要执行的路由目的地活动的重要性的厚度的信息。当到达该状态前的动作与路由源活动匹配，且动作与路由目的地活动匹配时，奖励将基于厚度设置。[选择图]图13

著录项

公开/公告号JP2022056238A

专利类型
公开/公告日2022-04-08

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号JP20200164148
发明设计人安原重人;岡田英之;吉本毅;岡田高幸;後藤禎;福田昇;山田航司;
展开▼

申请日2020-09-29
分类号G06F30/27;G06F30/10;G06N20;
国家 JP
入库时间 2022-08-25 00:29:43

相似文献

专利
外文文献
中文文献