首页> 外文会议>International Conference on Autonomous Agents and Multiagent Systems >Trial without Error: Towards Safe Reinforcement Learning via Human Intervention
【24h】

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

机译:没有错误的试验:通过人类干预攻击安全的强化学习

获取原文

摘要

During training, model-free reinforcement learning (RL) systems can explore actions that lead to harmful or costly consequences. Having a human "in the loop" and ready to intervene at all times can prevent these mistakes, but is prohibitively expensive for current algorithms. We explore how human oversight can be combined with a supervised learning system to prevent catastrophic events during training. We demonstrate this scheme on Atari games, with a Deep RL agent being overseen by a human for four hours. When the class of catastrophes is simple, we are able to prevent all catastrophes without affecting the agent's learning (whereas an RL baseline fails due to catastrophic forgetting).
机译:在培训期间,无模型加强学习(RL)系统可以探索导致有害或昂贵后果的行动。在人类的“循环中”并准备始终干预,可以防止这些错误,但对于当前算法来说是对昂贵的。我们探讨人类监督如何与监督学习系统相结合,以防止在培训期间防止灾难性事件。我们展示了Atari Games上的这个计划,一个深入的RL代理由人类监督四个小时。当灾难类很简单时,我们能够在不影响代理人的学习的情况下防止所有灾难(而RL基线因灾难性遗忘而失败)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号