Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

机译：没有错误的试验：通过人类干预攻击安全的强化学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

During training, model-free reinforcement learning (RL) systems can explore actions that lead to harmful or costly consequences. Having a human "in the loop" and ready to intervene at all times can prevent these mistakes, but is prohibitively expensive for current algorithms. We explore how human oversight can be combined with a supervised learning system to prevent catastrophic events during training. We demonstrate this scheme on Atari games, with a Deep RL agent being overseen by a human for four hours. When the class of catastrophes is simple, we are able to prevent all catastrophes without affecting the agent's learning (whereas an RL baseline fails due to catastrophic forgetting).

机译：在培训期间，无模型加强学习（RL）系统可以探索导致有害或昂贵后果的行动。在人类的“循环中”并准备始终干预，可以防止这些错误，但对于当前算法来说是对昂贵的。我们探讨人类监督如何与监督学习系统相结合，以防止在培训期间防止灾难性事件。我们展示了Atari Games上的这个计划，一个深入的RL代理由人类监督四个小时。当灾难类很简单时，我们能够在不影响代理人的学习的情况下防止所有灾难（而RL基线因灾难性遗忘而失败）。

著录项

来源
《International Conference on Autonomous Agents and Multiagent Systems》|2018年|1531-2267p|共3页
会议地点
作者
William Saunders; Andreas Stuhlmuller; Girish Sastry; Owain Evans;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task [J] . Viejo Guillaume, Girard Benoit, Procyk Emmanuel, Behavioural Brain Research: An International Journal . 2018,第期

机译：在非人类灵长类动物中执行试验和错误问题解决任务的自适应协调
2. SINGLE-TRIAL P300 DEFLECTIONS ARE LINKED TO COMPUTATIONALLY DERIVED PREDICTION ERRORS AND SUBSEQUENT BEHAVIORAL ADJUSTMENT IN PUNISHMENT-RELATED AND REWARD-RELATED REINFORCEMENT LEARNING [J] . Stolz Christopher, Mueller Erik M. Psychophysiology . 2019,第S1期

机译：单次试验P300偏转与计算衍生的预测误差以及随后的惩罚相关和奖励相关强化学习的行为调整
3. Neural correlates of risk prediction error during reinforcement learning in humans. [J] . dAcremont M, Lu ZL, Li X NeuroImage . 2009,第4期

机译：强化学习过程中风险预测误差的神经相关性。
4. Trial without Error: Towards Safe Reinforcement Learning via Human Intervention [C] . William Saunders, Andreas Stuhlmuller, Girish Sastry, International Conference on Autonomous Agents and Multiagent Systems . 2018

机译：没有错误的试验：通过人类干预攻击安全的强化学习
5. Robot Learning Dual-Arm Manipulation Tasks by Trial-And-Error and Multiple Human Demonstrations [D] . Kumra, Sulabh 2015

机译：机器人通过试错法和多个人类演示学习双手臂操纵任务
6. Neural Prediction Errors Reveal a Risk-Sensitive Reinforcement-Learning Process in the Human Brain [O] . Yael Niv, Jeffrey A. Edlund, Peter Dayan, 2012

机译：神经预测错误揭示了人脑中风险敏感的强化学习过程
7. Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task [O] . Viejo, Guillaume, Girard, Benoît, Procyk, Emmanuel, 2017

机译：非人类灵长类动物的工作记忆和强化学习的自适应协调，可以解决试错问题

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

摘要

著录项

相似文献

相关主题

期刊订阅