首页> 外文会议>Federated Conference on Computer Science and Information Systems >Adaptive Supervisor: Method of Reinforcement Learning Fault Elimination by Application of Supervised Learning
【24h】

Adaptive Supervisor: Method of Reinforcement Learning Fault Elimination by Application of Supervised Learning

机译:自适应主管:监督学习应用加强学习故障消除方法

获取原文

摘要

Reinforcement Learning (RL) is a popular approach for solving increasing number of problems. However, standard RL approach has many deficiencies. In this paper multiple approaches for addressing those deficiencies by incorporating Supervised Learning are discussed and a new approach, Reinforcement Learning with Adaptive Supervisor, is proposed. In this model, actions chosen by the RL method are rated by the supervisor and may be replaced with safer ones. The supervisor observes the results of each action and on that basis it learns the knowledge about safety of actions in various states. It helps to overcome one of the Reinforcement Learning deficiencies - risk of wrong action execution. The new approach is designed for domains, where failures are very expensive. The architecture was evaluated on a car intersection model. The proposed method eliminated around 50% of failures.
机译:强化学习(RL)是一种求解越来越多的问题的流行方法。但是,标准的RL方法有许多缺陷。在本文中,讨论了通过纳入受监管学习来解决这些缺陷的多种方法,并提出了一种新的方法,利用自适应主管加强学习。在该模型中,R1方法选择的操作由主管评定,并且可以用更安全的方式替换。主管观察每个行动的结果,并在此基础上,它学会了各种国家行动安全的知识。它有助于克服其中一个加强学习缺陷 - 错误行动执行的风险。新方法是为域设计的,故障非常昂贵。在汽车交叉点模型中评估了架构。所提出的方法消除了约50%的故障。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号