首页> 外文OA文献 >Towards deep symbolic reinforcement learning
【2h】

Towards deep symbolic reinforcement learning

机译:走向深刻的象征性强化学习

摘要

Deep reinforcement learning (DRL) brings the power of deep neural networks to bear on the generic task of trial-and-error learning, and its effectiveness has been convincingly demonstrated on tasks such as Atari video games and the game of Go. However, contemporary DRL systems inherit a number of shortcomings from the current generation of deep learning techniques. For example, they require very large datasets to work effectively, entailing that they are slow to learn even when such datasets are available. Moreover, they lack the ability to reason on an abstract level, which makes it difficult to implement high-level cognitive functions such as transfer learning, analogical reasoning, and hypothesis-based reasoning. Finally, their operation is largely opaque to humans, rendering them unsuitable for domains in which verifiability is important. In this paper, we propose an end-to-end reinforcement learning architecture comprising a neural back end and a symbolic front end with the potential to overcome each of these shortcomings. As proof-of-concept, we present a preliminary implementation of the architecture and apply it to several variants of a simple video game. We show that the resulting system -- though just a prototype -- learns effectively, and, by acquiring a set of symbolic rules that are easily comprehensible to humans, dramatically outperforms a conventional, fully neural DRL system on a stochastic variant of the game.
机译:深度强化学习(DRL)带来了深度神经网络的力量来承担试错学习的一般任务,并且在Atari视频游戏和Go游戏等任务上令人信服地证明了其有效性。但是,当代的DRL系统从当前的深度学习技术中继承了许多缺点。例如,他们需要非常大的数据集才能有效工作,这意味着即使有这样的数据集,学习起来也很慢。此外,他们缺乏抽象层次上的推理能力,这使得难以实现高级认知功能,例如迁移学习,类比推理和基于假设的推理。最后,它们的操作在很大程度上对人类是不透明的,从而使其不适用于可验证性很重要的领域。在本文中,我们提出了一种端到端的强化学习体系结构,该体系结构包含神经后端和符号前端,它们有可能克服这些缺点中的每一个。作为概念验证,我们介绍了该体系结构的初步实现,并将其应用于简单视频游戏的多种变体。我们证明了最终的系统-尽管只是一个原型-可以有效学习,并且通过获取一组易于人类理解的符号规则,在游戏的随机变体上大大优于传统的全神经DRL系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号