首页> 外文会议>Intelligent Control, 1994., Proceedings of the 1994 IEEE International Symposium on >A hybrid adaptive heuristic critic architecture for learning in large static search spaces
【24h】

A hybrid adaptive heuristic critic architecture for learning in large static search spaces

机译:用于在大型静态​​搜索空间中学习的混合自适应启发式批评家体系结构

获取原文

摘要

We present a hybrid Adaptive Heuristic Critic (AHC) architecture which learns an internal model of a maze environment through interaction with it. The adaptive critic's model is based around a radial basis function (RBF) neural network. Over successive trials the V-function is learned, a mapping between positions in the maze and their value. The model is based upon continuous valued spacial inputs and possesses the useful feature of "local generalisation" about the value associated with the region surrounding a position in the maze. An action policy allowing straight line movements to anywhere in the maze in a single step is adopted. This policy is implemented using a genetic algorithm (GA) which searches for an optimum movement at each time step. Although for computational convenience the GA is still based upon a discretized search of the maze-space the architecture should generalise well to evolutionary algorithms more suited to searching continuous spaces, allowing the concept of a discrete state to be dispensed with altogether.
机译:我们提出了一种混合自适应启发式批评(AHC)架构,该架构通过与迷宫环境的交互来学习迷宫环境的内部模型。自适应评论家模型基于径向基函数(RBF)神经网络。在连续的试验中,学习了V函数,即迷宫中的位置与其值之间的映射。该模型基于连续的有值空间输入,并且具有关于与迷宫中某个位置周围的区域相关联的值的“局部泛化”的有用功能。采取了允许在单个步骤中将直线直线移动到迷宫中任何地方的动作策略。使用遗传算法(GA)实施该策略,该遗传算法在每个时间步长搜索最佳运动。尽管为了便于计算,GA仍基于迷宫空间的离散搜索,但该体系结构应很好地推广到更适合于搜索连续空间的进化算法,从而可以完全省去离散状态的概念。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号