首页> 外文会议>Fuzzy Systems, 1997., Proceedings of the Sixth IEEE International Conference on >Fuzzy interpolation-based Q-learning with profit sharing plan scheme
【24h】

Fuzzy interpolation-based Q-learning with profit sharing plan scheme

机译:基于模糊插值的Q学习与收益分享计划方案

获取原文

摘要

We have previously (1996) proposed fuzzy interpolation-based Q-learning where fuzzy rules are used to represent Q-function (action utility function), in order to enable us to treat continuous-valued states and actions. In this paper, we will introduce the idea of profit sharing plan (PSP) used in classifier systems into the fuzzy interpolation-based Q-learning in order to accelerate the speed of learning and will discuss its effectiveness through applications to control problems such as cart-pole balancing problems.
机译:我们以前(1996)所提出的基于模糊插值的Q-Learning,其中模糊规则用于代表Q函数(动作实用程序函数),以便使我们能够治疗连续值的状态和动作。在本文中,我们将介绍分类系统中使用的利润共享计划(PSP)的想法,进入基于模糊插值的Q学习,以加速学习速度,并将通过应用程序讨论其效力来控制购物车等问题-pole平衡问题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号