An Improved Q-learning Algorithm Based on Exploration Region Expansion Strategy

机译：一种改进的基于勘探区域扩展策略的Q学习算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In order to find a good solution to one of the key problems in Q-learning algorithm - keeping the balance between exploration and exploitation, an improved Q-learning algorithm based on exploration region expansion strategy is proposed on the base of Metropolis criterion-based Q-learning. With this strategy, the exploration blindness in the entire environment is eliminated, and the learning efficiency is increased. Meanwhile, other feasible path is sought where agent encounters obstacles, which makes the implementation of the algorithm on real robot easy. An automatic termination condition is also put forward, therefore, the redundant learning after finding optimal path is avoided, and the time of learning is reduced. The validity of the algorithm is proved by simulation experiments.

机译：为了找到Q-Learnal算法中的一个关键问题的良好解决方案 - 保持勘探和开发之间的平衡，提出了一种基于勘探区域扩展策略的改进的Q学习算法，基于基于大都市标准的Q基础-学习。通过这种策略，消除了整个环境中的勘探失明，并且增加了学习效率。同时，寻求代理遇到障碍物的其他可行路径，这使得实际机器人的实现变得容易。因此，还提出了自动终止条件，因此避免了找到最佳路径之后的冗余学习，并且减少了学习的时间。通过模拟实验证明了算法的有效性。

著录项

来源
《World Congress on Intelligent Control and Automation》|2006年||共4页
会议地点
作者
Qingji Gao; Bingrong Hong; Zhendong He; Jie Liu; Guochen Niu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Q-learning; Exploration region expansion; Exploration-exploitation; Metropolis criterion;

机译：Q-学习;勘探区域扩张;探索 - 剥削;大都会标准;

相似文献

外文文献
中文文献
专利

1. A Sustainable Design Strategy Based on Building Morphology to Improve the Microclimate of University Campuses in Cold Regions of China Using an Optimization Algorithm [J] . Shuo Chen, Peng Cui, Hongyuan Mei Mathematical Problems in Engineering: Theory, Methods and Applications . 2021,第a期

机译：一种基于建筑形态的可持续设计策略，采用优化算法改善中国寒冷地区大学校园小气门
2. Adaptive job shop scheduling strategy based on weighted Q-learning algorithm [J] . Wang Yu-Fang Journal of Intelligent Manufacturing . 2020,第2期

机译：基于加权Q学习算法的自适应作业商店调度策略
3. A Q-learning algorithm for task scheduling based on improved SVM in wireless sensor networks [J] . Wei Zhenchun, Liu Fei, Zhang Yan, Computer networks . 2019,第Octa9期

机译：无线传感器网络中基于改进支持向量机的任务学习Q学习算法
4. An Improved Q-learning Algorithm Based on Exploration Region Expansion Strategy [C] . Qingji Gao, Bingrong Hong, Zhendong He, World Congress on Intelligent Control and Automation . 2006

机译：一种改进的基于勘探区域扩展策略的Q学习算法
5. Improving search in genetic algorithms through instinct-based mating strategies. [D] . Quirino, Thiago S. 2012

机译：通过基于本能的交配策略改善遗传算法中的搜索。
6. Broiler stunned state detection based on an improved fast region-based convolutional neural network algorithm [O] . Chang-wen Ye, Khurram Yousaf, Chao Qi, 2020

机译：基于改进的基于快速区域的卷积神经网络算法的肉鸡震惊状态检测
7. A Novel Path Planning Algorithm Based on Q-learning and Adaptive Exploration Strategy [O] . 2019

机译：一种基于Q学习和自适应探索策略的新型路径规划算法

An Improved Q-learning Algorithm Based on Exploration Region Expansion Strategy

摘要

著录项

相似文献

相关主题

期刊订阅