首页> 外国专利> CONTROL APPARATUS, CONTROL METHOD FOR CONTROL APPARATUS, NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM, INFORMATION PROCESSING SERVER, INFORMATION PROCESSING METHOD, AND CONTROL SYSTEM FOR CONTROLLING SYSTEM USING REINFORCEMENT LEARNING

CONTROL APPARATUS, CONTROL METHOD FOR CONTROL APPARATUS, NON-TRANSITORY COMPUTER READABLE STORAGE MEDIUM, INFORMATION PROCESSING SERVER, INFORMATION PROCESSING METHOD, AND CONTROL SYSTEM FOR CONTROLLING SYSTEM USING REINFORCEMENT LEARNING

机译:控制装置,控制装置的控制方法,非暂时性计算机可读存储介质,信息处理服务器,信息处理方法以及用于控制系统的控制系统,用于使用加强学习

摘要

A control apparatus for performing predetermined control for a predetermined system using reinforcement learning detects an event in a life cycle of the predetermined system and, in response to the detection of the event, set an exploration parameter specified in accordance with the detected event as a value for adjusting a ratio of exploration in the reinforcement learning. The control apparatus executes the predetermined control using the reinforcement learning in accordance with the set exploration parameter. When a first event is detected, the control apparatus sets the exploration parameter so that makes the ratio of the exploration set during a first period after the first event is smaller than the ratio of the exploration set during a second period before the first event is detected.
机译:用于使用增强学习对预定系统执行预定控制的控制装置检测预定系统的生命周期中的事件,并且响应于对事件的检测,将根据检测到的事件的检测设置为值设置指定的探索参数调整加强学习中勘探比率。控制装置根据设置探索参数使用增强学学习执行预定控制。当检测到第一事件时,控制装置设置探索参数,使得在第一事件小于检测到第一事件之前的第二个时段期间,在第一事件小于在第二次时段期间的探索集的比率期间使得探索集的比率。 。

著录项

  • 公开/公告号US2021192344A1

    专利类型

  • 公开/公告日2021-06-24

    原文格式PDF

  • 申请/专利权人 HONDA MOTOR CO. LTD.;

    申请/专利号US202017106458

  • 发明设计人 GAKUYO FUJIMOTO;

    申请日2020-11-30

  • 分类号G06N3/08;G07C5;G05B19/418;

  • 国家 US

  • 入库时间 2022-08-24 19:31:24

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号