首页> 外文会议>International MultiConference of Engineers and Computer Scientists >A New Approach for Value Function Approximation Based on Automatic State Partition
【24h】

A New Approach for Value Function Approximation Based on Automatic State Partition

机译:基于自动状态分区的价值函数近似的一种新方法

获取原文

摘要

Value function is usually used to deal with the reinforcement learning problems. In large or even continuous states, function approximation must be used to represent value function. Much of the current work carried out, however, has to design the structure of function approximation in advanced which cannot be adjusted during learning. In this pa- per, we propose a novel function approximation called Fuzzy CMAC (FCMAC) with automatic state partition (ASP-FCMAC) to automate the structure design for FCMAC. Based on CMAC (also known as tile coding), ASP-FCMAC employs fuzzy membership function to avoid the setting of parameter in CMAC, and makes use of Bellman error to partition the state automatically so as to generate the structure of FC- MAC. Empirical results in both mountain car and RoboCup Keepaway domains demonstrate that ASP- FCMAC can automatically generate the structure of FCMAC and agent using it can learn efficiently.
机译:价值函数通常用于处理强化学习问题。在大甚至连续状态下,必须使用函数近似来表示值函数。然而,在进行的许多当前工作必须设计在学习期间无法调整的先进功能近似的结构。在此PA-PER中,我们提出了一种具有自动状态分区(ASP-FCMAC)的模糊CMAC(FCMAC)的新功能近似,以自动化FCMAC的结构设计。基于CMAC(也称为瓷砖编码),ASP-FCMAC采用模糊隶属函数来避免在CMAC中设置参数,并利用Bellman错误自动分区状态,以便生成FC-Mac的结构。山地汽车和Robocup展开域的经验结果表明,ASP-FCMAC可以自动产生FCMAC和代理的结构,它可以有效地学习。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号