首页> 外国专利> How to build a trained model and a design support device using the trained model

How to build a trained model and a design support device using the trained model

机译:如何使用训练过的模型构建训练过的模型和设计支持设备

摘要

PROBLEM TO BE SOLVED: To facilitate setting of a reward in a method of constructing a trained model using reinforcement learning and a design support device using the trained model. SOLUTION: A reward R is given to a combination of a state S determined depending on whether or not a design activity is executed and an action A which is an activity selectable under the state, and the value is maximized. This is a method of constructing a trained model in which the trained model 50 is constructed, that is, step ST4 in which a reward is given based on a pre-input rule 16, and step ST5 in which reinforcement learning is performed based on the given reward. Including, the rule contains information on the source activity, the route destination activity to be performed later, and the thickness indicating the importance of the route destination activity to be performed after the route source activity, and in the step of awarding the reward. When the action immediately before reaching the state matches the route source activity and the action and the route destination activity match, the reward is set based on the thickness. [Selection diagram] FIG. 13
机译:需要解决的问题:通过使用强化学习构建训练模型的方法和使用训练模型的设计支持设备,方便设置奖励。解决方案:根据设计活动是否执行而确定的状态S和作为该状态下可选择的活动的动作A的组合会得到奖励R,并且该值会最大化。这是一种构造训练模型的方法,其中构造了训练模型50,即步骤ST4,其中基于预输入规则16给予奖励,步骤ST5,其中基于给定奖励执行强化学习。包括,该规则包含关于源活动、稍后要执行的路由目的地活动、指示在路由源活动之后以及在奖励步骤中要执行的路由目的地活动的重要性的厚度的信息。当到达该状态前的动作与路由源活动匹配,且动作与路由目的地活动匹配时,奖励将基于厚度设置。[选择图]图13

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号