A hybrid adaptive heuristic critic architecture for learning in large static search spaces

机译：用于在大型静态搜索空间中学习的混合自适应启发式批评家体系结构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We present a hybrid Adaptive Heuristic Critic (AHC) architecture which learns an internal model of a maze environment through interaction with it. The adaptive critic's model is based around a radial basis function (RBF) neural network. Over successive trials the V-function is learned, a mapping between positions in the maze and their value. The model is based upon continuous valued spacial inputs and possesses the useful feature of "local generalisation" about the value associated with the region surrounding a position in the maze. An action policy allowing straight line movements to anywhere in the maze in a single step is adopted. This policy is implemented using a genetic algorithm (GA) which searches for an optimum movement at each time step. Although for computational convenience the GA is still based upon a discretized search of the maze-space the architecture should generalise well to evolutionary algorithms more suited to searching continuous spaces, allowing the concept of a discrete state to be dispensed with altogether.

机译：我们提出了一种混合自适应启发式批评（AHC）架构，该架构通过与迷宫环境的交互来学习迷宫环境的内部模型。自适应评论家模型基于径向基函数（RBF）神经网络。在连续的试验中，学习了V函数，即迷宫中的位置与其值之间的映射。该模型基于连续的有值空间输入，并且具有关于与迷宫中某个位置周围的区域相关联的值的“局部泛化”的有用功能。采取了允许在单个步骤中将直线直线移动到迷宫中任何地方的动作策略。使用遗传算法（GA）实施该策略，该遗传算法在每个时间步长搜索最佳运动。尽管为了便于计算，GA仍基于迷宫空间的离散搜索，但该体系结构应很好地推广到更适合于搜索连续空间的进化算法，从而可以完全省去离散状态的概念。

著录项

来源
《Intelligent Control, 1994., Proceedings of the 1994 IEEE International Symposium on》|1994年|P.237-242|共6页
会议地点
作者
Pipe; A.G.; Jin; Y.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. A hybrid adaptive large neighborhood search heuristic for the team orienteering problem [J] . Hammami Farouk, Rekik Monia, Coelho Leandro C. Computers & operations research . 2020,第Nova期

机译：一个混合自适应大街区搜索启发式，队伍定向问题
2. A HYBRID GREEDY RANDOMIZED ADAPTIVE SEARCH HEURISTIC TO SOLVE THE DIAL-A-RIDE PROBLEM [J] . FRANCESCA GUERRIERO, MARIA ELENA BRUNI, FRANCESCA GRECO Asia-Pacific Journal of Operational Research . 2013,第1期

机译：混合贪婪随机搜索的启发式解决“搭便车”问题
3. A hybrid adaptive large neighborhood search heuristic for lot-sizing with setup times [J] . Muller L.F., Spoorendonk S., Pisinger D. European Journal of Operational Research . 2012,第3期

机译：混合自适应大邻域搜索启发式算法，用于按设置时间进行批量
4. A hybrid adaptive heuristic critic architecture for learning inlarge static search spaces [C] . Pipe A.G., Jin Y., Winfield A. Intelligent Control, 1994., Proceedings of the 1994 IEEE International Symposium on . -1

机译：混合自适应启发式批评家架构，用于学习大型静态搜索空间
5. Adaptive critic designs based neurocontrollers for local and wide area control of a multimachine power system with a static compensator [D] . Mohagheghi, Salman 2007

机译：基于自适应批评家设计的神经控制器，用于带有静态补偿器的多机电源系统的局部和广域控制
6. A Hybrid Color Space for Skin Detection Using Genetic Algorithm Heuristic Search and Principal Component Analysis Technique [O] . Mahdi Maktabdar Oghaz, Mohd Aizaini Maarof, Anazida Zainal, -1

机译：基于遗传算法启发式搜索和主成分分析技术的肤色混合检测空间
7. Speeding-Up Adaptive Heuristic Critic Learning with FPGA-Based Unsupervised Clustering [O] . Andrés Pérez-Uribe, Eduardo Sanchez 1997

机译：利用基于FPGA的无监督聚类加速自适应启发式批判学习

A hybrid adaptive heuristic critic architecture for learning in large static search spaces

摘要

著录项

相似文献

相关主题

期刊订阅