A Direct Policy-Search Algorithm for Relational Reinforcement Learning

机译：一种直接策略研究的关系强化学习算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the field of relational reinforcement learning - a representational generalisation of reinforcement learning - the first-order representation of environments results in a potentially infinite number of possible states, requiring learning agents to use some form of abstraction to learn effectively. Instead of forming an abstraction over the state-action space, an alternative technique is to create behaviour directly through policy-search. The algorithm named CERRLA presented in this paper uses the cross-entropy method to learn behaviour directly in the form of decision-lists of relation rules for solving problems in a range of different environments, without the need for expert guidance in the learning process. The behaviour produced by the algorithm is easy to comprehend and is biased towards compactness. The results obtained show that CERRLA is competitive in both the standard testing environment and in Ms. PAC-MAN and CARCASSONNE, two large and complex game environments.

机译：在关系强度学习领域 - 增强学习的代表性概括 - 环境的一阶表示导致可能的无限数量的可能状态，需要学习代理使用某种形式的抽象来有效地学习。不是通过状态动作空间形成抽象，而是一种替代技术是通过策略搜索直接创建行为。本文提出的算法名为Cerrla使用跨熵方法直接以决策列表的形式学习行为，以解决一系列不同环境中的问题，而无需专家指导。算法产生的行为易于理解，并偏向紧凑。得到的结果表明，Cerrla在标准测试环境和Pac-Man和Carcassonne女士，两个大型和复杂的游戏环境中具有竞争力。

著录项

来源
《International Conference on Inductive Logic Programming》|2014年||共17页
会议地点
作者
Samuel Sarjant; Bernhard Pfahringer; Kurt Driessens; Tony Smith;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311-53;
关键词

相似文献

外文文献
中文文献
专利

1. Interplay of Rhythmic and Discrete Manipulation Movements During Development: A Policy-Search Reinforcement-Learning Robot Model [J] . Valentina Cristina Meola, Daniele Caligiore, Valerio Sperati, IEEE Transactions on Cognitive and Developmental Systems . 2016,第3期

机译：有节奏的和离散的操纵运动在开发过程中的相互作用：政策搜索强化学习机器人模型
2. Adopting Relational Reinforcement Learning in Covering Algorithms for Numeric and Noisy Environments [J] . ElGibreen Hebah, Aksoy Mehmet Sabih International journal of computational intelligence systems . 2016,第3期

机译：在数值和噪声环境下的覆盖算法中采用关系强化学习
3. Relational Reinforcement Learning: Lifting Propositional Algorithms and Size Matters [J] . Martijn van Otterlo OGAI Journal . 2008,第1期

机译：关系强化学习：提升命题算法和大小问题
4. A Direct Policy-Search Algorithm for Relational Reinforcement Learning [C] . Samuel Sarjant, Bernhard Pfahringer, Kurt Driessens, International conference on inductive logic programming . 2014

机译：关系强化学习的直接策略搜索算法
5. A learning classifier system approach to relational reinforcement learning [D] . Mellor, Drew 2008

机译：关系强化学习的学习分类器系统方法
6. Algorithmic Analysis of Relational Learning Processes in Instructional Technology: Some Implications for Basic Translational and Applied Research [O] . William J. McIlvane, Joanne B. Kledaras, Christophe J. Gerard, -1

机译：教学技术中关系学习过程的算法分析：对基础研究翻译研究和应用研究的一些启示
7. Interplay of rhythmic and discrete manipulation movements during development: a policy-search reinforcement-learning robot model [O] . Meola Valentina Cristina, Caligiore Daniele, Sperati Valerio, 2016

机译：开发过程中有节奏和离散的操纵动作之间的相互作用：策略搜索强化学习机器人模型

A Direct Policy-Search Algorithm for Relational Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅