基于矢量量化的强化学习及其在机器人行为学习中的应用

段勇; 伊婧; 张永赫; 徐心和

首页> 中文期刊> 《高技术通讯》 >基于矢量量化的强化学习及其在机器人行为学习中的应用

基于矢量量化的强化学习及其在机器人行为学习中的应用

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

针对强化学习(RL)中状态空间过大所引起的学习时间过长或算法难于收敛等问题,提出了一种基于矢量量化(VQ)技术的表格型强化学习方法--VQRL方法,该方法用矢量量化器的码书矢量来逼近强化学习的状态空间,从而有效地解决了强化学习的状态空间分割问题,并提高了学习的收敛速度.同时根据等失真理论将一种失真敏感自组织特征映射(SOFM)神经网络用于矢量量化,以达到更好的强化学习状态空间泛化性能.将此方法应用于反应式移动机器人的行为学习的实验验证了此方法的有效性,实验表明,此方法能够较好地解决复杂未知环境的机器人导航问题.%Considering that in the course of reinforcement learning (RL), the too large state space causes the problems of long time leaming and difficulty in the learning algorithm's convergence, the paper proposes the VQRL method, a LookupTable reinforcement learning method based on vector quantization (VQ). The proposed method utilizes the codebook of vector quantization to approximate the continuous state space of reinforcement leaming, which solves the partition state space problem of RL and improves the speed of convergence effectively. And based on the equal distortion theory, it uses a distortion sensitive self-organizing feature map (SOFM) to quantize vectors. Therefore, the favorable generalization performance of state space can be obtained. The proposed method was used for learning the behavior of a reactive robot. The experiments showed the effectiveness of the presented algorithm. It can effectively solve the navigation problems under complicated unknown environments.

著录项

来源
《高技术通讯》 |2011年第2期|179-184|共6页
作者
段勇; 伊婧; 张永赫; 徐心和;
展开▼
作者单位

沈阳工业大学信息科学与工程学院,沈阳,110870;

沈阳工业大学信息科学与工程学院,沈阳,110870;

沈阳工业大学信息科学与工程学院,沈阳,110870;

东北大学信息科学与工程学院,沈阳,110819;

展开▼
原文格式 PDF
正文语种 chi
中图分类
关键词
强化学习(RL); 矢量量化(VQ); 码书; Q(λ)学习; 自组织特征映射;

相似文献

中文文献
外文文献
专利

1. 基于二型模糊系统的强化学习及其在机器人行为学习中的应用 [J] . 段勇 ,伊婧 . 制造业自动化 . 2011,第022期
2. 模糊强化学习型的图像矢量量化算法 [J] . 姜来 ,许文焕 ,纪震 . 电子学报 . 2006,第009期
3. 基于强化学习的多机器人合作行为获取 [J] . 李冬梅 ,陈卫东 ,席裕庚 . 上海交通大学学报 . 2005,第8期
4. 基于模糊小波网络的强化学习及其在多机器人决策策略中的应用 [J] . 段勇 ,李程 ,徐心和 . 高技术通讯 . 2013,第004期
5. 基于强化学习的进化神经网络及其在机器人导航中的应用 [J] . 李佳鹤 ,姚明海 . 浙江工业大学学报 . 2010,第006期
6. 基于自适应量化的智能机器人强化学习方法研究 [C] . 张汝波 ,王醒策 ,杨广铭 . 第三届全球智能控制与自动化大会 . 2000
7. 强化学习及其在自主机器人行为学习中的应用 [A] . 杨丽 . 2002

基于矢量量化的强化学习及其在机器人行为学习中的应用

摘要

著录项

相似文献

相关主题

期刊订阅