Adaptive Supervisor: Method of Reinforcement Learning Fault Elimination by Application of Supervised Learning

机译：自适应主管：监督学习应用加强学习故障消除方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Reinforcement Learning (RL) is a popular approach for solving increasing number of problems. However, standard RL approach has many deficiencies. In this paper multiple approaches for addressing those deficiencies by incorporating Supervised Learning are discussed and a new approach, Reinforcement Learning with Adaptive Supervisor, is proposed. In this model, actions chosen by the RL method are rated by the supervisor and may be replaced with safer ones. The supervisor observes the results of each action and on that basis it learns the knowledge about safety of actions in various states. It helps to overcome one of the Reinforcement Learning deficiencies - risk of wrong action execution. The new approach is designed for domains, where failures are very expensive. The architecture was evaluated on a car intersection model. The proposed method eliminated around 50% of failures.

机译：强化学习（RL）是一种求解越来越多的问题的流行方法。但是，标准的RL方法有许多缺陷。在本文中，讨论了通过纳入受监管学习来解决这些缺陷的多种方法，并提出了一种新的方法，利用自适应主管加强学习。在该模型中，R1方法选择的操作由主管评定，并且可以用更安全的方式替换。主管观察每个行动的结果，并在此基础上，它学会了各种国家行动安全的知识。它有助于克服其中一个加强学习缺陷 - 错误行动执行的风险。新方法是为域设计的，故障非常昂贵。在汽车交叉点模型中评估了架构。所提出的方法消除了约50％的故障。

著录项

来源
《Federated Conference on Computer Science and Information Systems》|2018年|544p|共5页
会议地点
作者
Mateusz Krzyszton;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. A Deep Learning Algorithm for the Max-Cut Problem Based on Pointer Network Structure with Supervised Learning and Reinforcement Learning Strategies [J] . Shenshen Gu, Yue Yang Mathematics . 2020,第2期

机译：一种深入学习算法，基于指针网络结构与监督学习和加固学习策略
2. Adaptive Service Management in Mobile Cloud Computing by Means of Supervised and Reinforcement Learning [J] . Piotr Nawrocki, Bartlomiej Sniezynski Journal of network and systems management . 2018,第1期

机译：通过监督和强化学习在移动云计算中进行自适应服务管理
3. Supervised dictionary-based transfer subspace learning and applications for fault diagnosis of sucker rod pumping systems [J] . Zhang Ao, Gao Xianwen Neurocomputing . 2019,第APRa21期

机译：基于监督词典的转移子空间学习及其在抽油杆抽油系统故障诊断中的应用
4. Adaptive Supervisor: Method of Reinforcement Learning Fault Elimination by Application of Supervised Learning [C] . Mateusz Krzyszton Federated Conference on Computer Science and Information Systems . 2018

机译：自适应主管：监督学习应用加强学习故障消除方法
5. Training a Neural Network to Construct Sentences from an Inputted Word List: A Comparison Between Supervised and Reinforcement Learning Methods [D] . Black, Samuel 2018

机译：训练神经网络以从输入的单词列表构建句子：监督学习和强化学习方法之间的比较
6. Self-Supervised Joint Learning Fault Diagnosis Method Based on Three-Channel Vibration Images [O] . Weiwei Zhang, Deji Chen, Yang Kong 2021

机译：基于三通道振动图像的自我监督联合学习故障诊断方法
7. A Comparison Of Supervised And Reinforcement Learning Methods On A Reinforcement Learning Task [O] . Vijaykumar Gullapalli 1992

机译：强化学习任务中监督学习和强化学习方法的比较
8. Drive-Reinforcement Learning: A Self-Supervised Model for Adaptive Control [R] . Morgan, J. S., Patterson, E. C., Klopf, A. H. 1990

机译：驱动强化学习：自适应控制的自监督模型

Adaptive Supervisor: Method of Reinforcement Learning Fault Elimination by Application of Supervised Learning

摘要

著录项

相似文献

相关主题

期刊订阅