首页> 外文会议>Evolutionary Computation, 2005. The 2005 IEEE Congress on >AgentP classifier system: self-adjusting vs. gradual approach
【24h】

AgentP classifier system: self-adjusting vs. gradual approach

机译:AgentP分类器系统:自我调整与渐进方法

获取原文
获取外文期刊封面目录资料

摘要

Learning classifier systems belong to the class of algorithms based on the principle of self-organization and evolution and have frequently been applied to mazes, an important type of reinforcement learning problem. Mazes may contain aliasing cells, i.e. squares in a different location that look identical to an agent with limited perceptive power. Mazes with aliasing squares present a particular difficult learning problem. As a possible approach to the problem, AgentP, a learning classifier system with associative perception, was recently introduced. AgentP is based on the psychological model of associative perception learning and operates explicitly imprinted images of the environment states. Two types of learning mode are described: the first, self-adjusting AgentP, is more flexible and adapts rapidly to changing information; the second, gradual AgentP, is more conservative in drawing conclusions and rigid when it comes to revising strategy. The performance of both systems is tested on existing and new aliasing environments. The results show that AgentP often outperforms (and always at least matches) the performance of other techniques and, on the large majority of mazes used, learns optimal or near optimal solutions with fewer trials and a smaller classifier population.
机译:学习分类器系统属于基于自组织和进化原理的算法类别,并且已被广泛应用于迷宫游戏中,迷宫游戏是强化学习问题的一种重要类型。迷宫可能包含混叠单元,即在不同位置的正方形,看起来与感知能力有限的主体相同。带有锯齿方块的迷宫提出了一个特殊的学习难题。作为解决该问题的一种可能方法,最近引入了AgentP,一种具有联想感知的学习分类器系统。 AgentP基于联想感知学习的心理模型,并操作环境状态的显式印迹图像。描述了两种学习模式:第一种是自我调整的AgentP,它更灵活并且可以迅速适应不断变化的信息。第二个是渐进的AgentP,在得出结论时较为保守,在修改策略时则比较僵化。两种系统的性能都在现有和新的别名环境上进行了测试。结果表明,AgentP的性能通常优于(并且始终至少与之匹配)其他技术的性能,并且在使用的大多数迷宫上,通过较少的试验和较小的分类器种群即可学习最佳或接近最佳的解决方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号