首页> 外文会议>International conference on parallel problem solving from nature >Optimality-Based Analysis of XCSF Compaction in Discrete Reinforcement Learning
【24h】

Optimality-Based Analysis of XCSF Compaction in Discrete Reinforcement Learning

机译:离散强化学习中基于最优性的XCSF压实分析

获取原文

摘要

Learning classifier systems (LCSs) are population-based predictive systems that were originally envisioned as agents to act in reinforcement learning (RL) environments. These systems can suffer from population bloat and so are amenable to compaction techniques that try to strike a balance between population size and performance. A well-studied LCS architecture is XCSF, which in the RL setting acts as a Q-function approximator. We apply XCSF to a deterministic and stochastic variant of the FrozenLake8x8 environment from OpenAI Gym, with its performance compared in terms of function approximation error and policy accuracy to the optimal Q-functions and policies produced by solving the environments via dynamic programming. We then introduce a novel compaction algorithm (Greedy Niche Mass Compaction-GNMC) and study its operation on XCSF's trained populations. Results show that given a suitable parametrisation, GNMC preserves or even slightly improves function approximation error while yielding a significant reduction in population size. Reasonable preservation of policy accuracy also occurs, and we link this metric to the commonly used steps-to-goal metric in maze-like environments, illustrating how the metrics are complementary rather than competitive.
机译:学习分类器系统(LCS)是基于人口的预测系统,最初被设想为在强化学习(RL)环境中起作用的主体。这些系统可能会遭受人口膨胀的困扰,因此适合尝试在人口规模和绩效之间寻求平衡的压实技术。一个经过充分研究的LCS架构是XCSF,它在RL设置中充当Q函数逼近器。我们将XCSF应用于OpenAI Gym的FrozenLake8x8环境的确定性和随机变体,其性能在函数逼近误差和策略准确性方面与通过动态编程解决环境所产生的最佳Q函数和策略进行了比较。然后,我们介绍一种新颖的压缩算法(Greedy Niche Mass Compaction-GNMC),并研究它在XCSF受过训练的人群上的运行情况。结果表明,给定合适的参数设置,GNMC可以保留或什至略微改善函数逼近误差,同时可以显着减少总体规模。还可以合理地保留策略的准确性,并且我们将此度量标准与类似迷宫的环境中常用的“逐步实现目标”度量标准进行了链接,说明了这些度量标准是互补的而不是竞争性的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号