Pseudo-rehearsal: Achieving deep reinforcement learning without catastrophic forgetting

Atkinson Craig; McCane Brendan; Szymanski Lech; Robins Anthony

首页> 外文期刊>Neurocomputing >Pseudo-rehearsal: Achieving deep reinforcement learning without catastrophic forgetting

【24h】

Pseudo-rehearsal: Achieving deep reinforcement learning without catastrophic forgetting

机译：伪排练：在没有灾难性的遗忘的情况下实现深度加强学习

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相关主题

摘要

Neural networks can achieve excellent results in a wide variety of applications. However, when they attempt to sequentially learn, they tend to learn the new task while catastrophically forgetting previous ones. We propose a model that overcomes catastrophic forgetting in sequential reinforcement learning by combining ideas from continual learning in both the image classification domain and the reinforcement learning domain. This model features a dual memory system which separates continual learning from reinforcement learning and a pseudo-rehearsal system that "recalls" items representative of previous tasks via a deep generative network. Our model sequentially learns Atari 2600 games without demonstrating catastrophic forgetting and continues to perform above human level on all three games. This result is achieved without: demanding additional storage requirements as the number of tasks increases, storing raw data or revisiting past tasks. In comparison, previous state-of-the-art solutions are substantially more vulnerable to forgetting on these complex deep reinforcement learning tasks. (C) 2020 Elsevier B.V. All rights reserved.

机译：神经网络可以在各种应用中实现优异的结果。然而，当他们试图顺序学习时，他们倾向于在灾难性地忘记以前的时学习新任务。我们提出了一种模型，通过在图像分类领域和加强学习领域的连续学习中结合思路来克服灾难性的遗忘。该模型具有双存储器系统，该系统将持续学习与强化学习和伪排练系统分开，即“召回”通过深生成网络代表先前任务的项目。我们的模式顺序地学习Atari 2600游戏，而不会展示灾难性的遗忘，并继续在所有三场比赛上执行以上人类水平。此结果是实现的，而无需：当任务数量增加，存储原始数据或重新审视过去任务时，要求额外的存储要求。相比之下，以前的最先进的解决方案很容易忘记这些复杂的深度加强学习任务。（c）2020 Elsevier B.v.保留所有权利。

著录项

来源
《Neurocomputing》 |2021年第7期|291-307|共17页
作者
Atkinson Craig; McCane Brendan; Szymanski Lech; Robins Anthony;
展开▼
作者单位

Univ Otago Dept Comp Sci 133 Union St East Dunedin New Zealand;

Univ Otago Dept Comp Sci 133 Union St East Dunedin New Zealand;

Univ Otago Dept Comp Sci 133 Union St East Dunedin New Zealand;

Univ Otago Dept Comp Sci 133 Union St East Dunedin New Zealand;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Deep reinforcement learning; Pseudo-rehearsal; Catastrophic forgetting; Generative adversarial network;

机译：深增强学习;伪排练;灾难性的遗忘;生成的对抗网络;

Pseudo-rehearsal: Achieving deep reinforcement learning without catastrophic forgetting

摘要

著录项

引文网络

相关主题

期刊订阅