Towards deep symbolic reinforcement learning

机译：走向深刻的象征性强化学习

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep reinforcement learning (DRL) brings the power of deep neural networks to bear on the generic task of trial-and-error learning, and its effectiveness has been convincingly demonstrated on tasks such as Atari video games and the game of Go. However, contemporary DRL systems inherit a number of shortcomings from the current generation of deep learning techniques. For example, they require very large datasets to work effectively, entailing that they are slow to learn even when such datasets are available. Moreover, they lack the ability to reason on an abstract level, which makes it difficult to implement high-level cognitive functions such as transfer learning, analogical reasoning, and hypothesis-based reasoning. Finally, their operation is largely opaque to humans, rendering them unsuitable for domains in which verifiability is important. In this paper, we propose an end-to-end reinforcement learning architecture comprising a neural back end and a symbolic front end with the potential to overcome each of these shortcomings. As proof-of-concept, we present a preliminary implementation of the architecture and apply it to several variants of a simple video game. We show that the resulting system -- though just a prototype -- learns effectively, and, by acquiring a set of symbolic rules that are easily comprehensible to humans, dramatically outperforms a conventional, fully neural DRL system on a stochastic variant of the game.

机译：深度强化学习（DRL）带来了深度神经网络的力量来承担试错学习的一般任务，并且在Atari视频游戏和Go游戏等任务上令人信服地证明了其有效性。但是，当代的DRL系统从当前的深度学习技术中继承了许多缺点。例如，他们需要非常大的数据集才能有效工作，这意味着即使有这样的数据集，学习起来也很慢。此外，他们缺乏抽象层次上的推理能力，这使得难以实现高级认知功能，例如迁移学习，类比推理和基于假设的推理。最后，它们的操作在很大程度上对人类是不透明的，从而使其不适用于可验证性很重要的领域。在本文中，我们提出了一种端到端的强化学习体系结构，该体系结构包含神经后端和符号前端，它们有可能克服这些缺点中的每一个。作为概念验证，我们介绍了该体系结构的初步实现，并将其应用于简单视频游戏的多种变体。我们证明了最终的系统-尽管只是一个原型-可以有效学习，并且通过获取一组易于人类理解的符号规则，在游戏的随机变体上大大优于传统的全神经DRL系统。

著录项

作者
Garnelo M; Arulkumaran K; Shanahan M;
展开▼
作者单位

展开▼
年度 2016
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Learning to balance an NAO robot using reinforcement learning with symbolic inverse kinematic [J] . Tutsoy Onder, Barkana Duygun Erol, Colak Sule Transactions of the Institute of Measurement and Control . 2017,第11期

机译：使用符号逆运动学使用钢筋学习来平衡Nao机器人
2. Advanced planning for autonomous vehicles using reinforcement learning and deep inverse reinforcement learning [J] . You Changxi, Lu Jianbo, Filev Dimitar, Robotics and Autonomous Systems . 2019,第期

机译：利用强化学习和深度逆钢筋学习的自治车辆先进规划
3. Characterization of Symbolic Rules Embedded in Deep DIMLP Networks: A Challenge to Transparency of Deep Learning [J] . Guido Bologna, Yoichi Hayashi Journal of Artificial Intelligence and Soft Computing Research . 2017,第4期

机译：深度DIMLP网络中嵌入的符号规则的表征：对深度学习透明度的挑战
4. SDRL: Interpretable and Data-Efficient Deep Reinforcement Learning Leveraging Symbolic Planning [C] . Daoming Lyu, Fangkai Yang, Bo Liu, AAAI Conference on Artificial Intelligence . 2019

机译：SDRL：解释和数据有效的深度加强学习，利用象征性规划
5. On Deep Reinforcement Learning for Games: Generalization of Deep Q-Learning with Multiple Policy Heads [D] . Boucher, Mathieu. 2020

机译：关于游戏的深度加固学习：多重政策头部深度Q学的泛化
6. Learning for a Robot: Deep Reinforcement Learning Imitation Learning Transfer Learning [O] . Jiang Hua, Liangcai Zeng, Gongfa Li, 2021

机译：学习机器人：深增强学习仿制学习转移学习
7. A Comparison of Deep Reinforcement Learning and Deep learning for Complex Image Analysis [O] . Rishi Khajuria, Abdul Quyoom, Abid Sarwar 2020

机译：复杂图像分析的深度增强学习与深度学习的比较

Towards deep symbolic reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅