Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

机译：预期学习分类器系统和强化学习

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Factored Reinforcement Learning (FRL) is a new technique to solve Factored Markov Decision Problems (FMDPs) when the structure of the problem is not known in advance. Like Anticipatory Learning Classifier Systems (LACSs), it is a model-based Reinforcement Learning approach that includes generalization mechanisms in the presence of a structured domain. In general, FRL and ALCSs are explicit, state-anticipatory approaches that learn generalized state transition models to improve system behavior based on model-based reinforcement learning techniques. In this contribution, we highlight the conceptual similarities and differences between FRL and ALCSs, focusing on the one hand on SPITI, an instance of frl method, and on ALCSs, MACS and XACS, on the other hand. Though FRL systems seem to benefit from a clearer theoretical grounding, an empirical comparison between SPITI and XACS on two benchmark problems reveals that the latter scales much better than the former when some combination of state variables do not occur. Based on this finding, we discuss the mechanisms in XACS that result in the better scalability and propose importing these mechanisms into FRL systems.

机译：分解式强化学习（FRL）是一种新的技术，用于在问题的结构未知时解决分解式马尔可夫决策问题（FMDP）。与预期学习分类器系统（LACS）一样，它是一种基于模型的强化学习方法，其中包括存在结构化域时的泛化机制。通常，FRL和ALCS是显式的状态预期方法，可基于基于模型的强化学习技术来学习广义状态转换模型以改善系统行为。在本文中，我们着重强调了FRL和ALCS之间的概念异同，一方面着重于SPITI（frl方法的一个实例），另一方面着重于ALCS，MACS和XACS。尽管FRL系统似乎受益于更清晰的理论基础，但是SPITI和XACS在两个基准问题上的经验比较显示，当状态变量不发生某种组合时，后者的伸缩性要比前者好得多。基于此发现，我们讨论了XACS中可带来更好可伸缩性的机制，并建议将这些机制导入FRL系统。

著录项

来源
《Anticipatory behavior in adaptive learning systems : Form psychological theories to artificial cognitive systems 》|2008年|P.321-333|共13页
会议地点 Munich(DE);Munich(DE);Munich(DE)
作者
Olivier Sigaud; rnMartin V. Butz; rnOlga Kozlova; rnChristophe Meyer;
展开▼
作者单位

Universite Pierre et Marie Curie - Paris6 Institut des Systemes Intelligents et de Robotique (ISIR), CNRS UMR 7222, 4 place Jussieu, F-75005 Paris, France;

University of Wuerzburg Roentgenring 11 97070 Wuerzburg, Germany;

rnUniversite Pierre et Marie Curie - Paris6 Institut des Systemes Intelligents et de Robotique (ISIR), CNRS UMR 7222, 4 place Jussieu, F-75005 Paris, France Thales Security Solutions Services, Simulation 1 rue du General de Gaulle, Osny BP 226 F95523 Cergy Pontoise Cedex, France;

rnThales Security Solutions Services, ThereSIS Research and Innovation Office Route departementale 128 F91767 Palaiseau Cedex, France;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类人工智能理论 ;
关键词

相似文献

外文文献
中文文献
专利

1. Accuracy-Based Learning Classifier Systems for Multistep Reinforcement Learning: A Fuzzy Logic Approach to Handling Continuous Inputs and Learning Continuous Actions [J] . Gang Chen, Colin I. J. Douch, Mengjie Zhang IEEE transactions on evolutionary computation . 2016 ,第6期

机译：基于精度的多步强化学习分类器系统：处理连续输入和学习连续动作的模糊逻辑方法
2. Swarm robots reinforcement learning convergence accuracy-based learning classifier systems with gradient descent (XCS-GD) [J] . Jie Shao, Haixia Lin, Kaibian Zhang Neural computing & applications . 2014 ,第2期

机译：群体机器人强化学习基于梯度下降的基于学习精度的学习分类器系统（XCS-GD）
3. Learning classifier systems from a reinforcement learning perspective [J] . P. L. Lanzi Soft computing: A fusion of foundations, methodologies and applications . 2002 ,第3a4期

机译：从强化学习的角度学习分类器系统
4. Anticipatory Learning Classifier Systems and Factored Reinforcement Learning [C] . Olivier Sigaud, Martin V. Butz, Olga Kozlova, Workshop on Anticipatory Behavior in Adaptive Learning Systems . 2009

机译：预期学习分类器系统和辅助加固学习
5. A learning classifier system approach to relational reinforcement learning [D] . Mellor, Drew 2008

机译：关系强化学习的学习分类器系统方法
6. Dissociating the Contributions of Independent Corticostriatal Systems to Visual Categorization Learning Through the Use of Reinforcement Learning Modeling and Granger Causality Modeling [O] . Carol A. Seger, Erik J. Peterson, Corinna M. Cincotta, -1

机译：解离独立的皮质纹状体系统到Visual分类学的贡献通过强化学习模型和格兰杰因果关系模型的使用
7. Local Policy-sharing Systems for Multi-agent Reinforcement Learning-An Approach from the Learning Classifier System [O] . Hiroyasu INOUE, Katsunori SHIMOHARA, Osamu KATAI 2006

机译：用于多智能经纪增强学习的地方策略共享系统 - 来自学习分类器系统的方法

Anticipatory Learning Classifier Systems and Factored Reinforcement Learning

摘要

著录项

相似文献

相关主题

期刊订阅