Fuzzy interpolation-based Q-learning with profit sharing plan scheme

机译：基于模糊插值的Q学习与收益分享计划方案

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We have previously (1996) proposed fuzzy interpolation-based Q-learning where fuzzy rules are used to represent Q-function (action utility function), in order to enable us to treat continuous-valued states and actions. In this paper, we will introduce the idea of profit sharing plan (PSP) used in classifier systems into the fuzzy interpolation-based Q-learning in order to accelerate the speed of learning and will discuss its effectiveness through applications to control problems such as cart-pole balancing problems.

机译：我们以前（1996）所提出的基于模糊插值的Q-Learning，其中模糊规则用于代表Q函数（动作实用程序函数），以便使我们能够治疗连续值的状态和动作。在本文中，我们将介绍分类系统中使用的利润共享计划（PSP）的想法，进入基于模糊插值的Q学习，以加速学习速度，并将通过应用程序讨论其效力来控制购物车等问题-pole平衡问题。

著录项

来源
《Fuzzy Systems, 1997., Proceedings of the Sixth IEEE International Conference on》|1997年|P.1707-1712|共6页
会议地点
作者
Horiuchi; T.; Fujino; A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Fuzzy interpolation-based Q-learning with continuous inputs and outputs [J] . Tadashi Horiuchi, Akinori Fujino, Osamu Katai, 計測自動制御学会論文集 . 1999,第2期

机译：具有连续输入和输出的基于模糊插值的Q学习
2. Fuzzy interpolation-based Q-learning with continuous inputs and outputs [J] . Tadashi Horiuchi, Akinori Fujino, Osamu Katai, 計測自動制御学会論文集 . 1999,第2期

机译：基于模糊插值的Q-Learning，具有连续输入和输出
3. Multi-item fuzzy-stochastic supply chain models for long-term contracts with a profit sharing scheme [J] . Anirban Saha, Samarjit Kar, Manoranjan Maiti Applied Mathematical Modelling . 2015,第10a11期

机译：具有利润分配方案的长期合同的多项目模糊随机供应链模型
4. Fuzzy interpolation-based Q-learning with profit sharing plan scheme [C] . Horiuchi T., Fujino A., Institute of Electric and Electronic Engineer IEEE International Conference on Fuzzy Systems . 1997

机译：基于模糊插值的Q-Learning，利润共享计划计划
5. Opportunistic Routing Schemes for Large-scale and Heterogeneous Multi-hop Wireless Networks Using Directed Energy Links and Fuzzy Logic Q-Learning [D] . ?Alshehri, Ali M. 2020

机译：使用定向能量链接和模糊逻辑Q学习的大型和异构多跳无线网络的机会路由方案
6. Composite Interpolation-Based Multiscale Fuzzy Entropy and Its Application to Fault Diagnosis of Rolling Bearing [O] . Qingyun Liu, Haiyang Pan, Jinde Zheng, 2019

机译：基于复合插值的多尺度模糊熵及其在滚动轴承故障诊断中的应用
7. Rule-base reduction in Fuzzy Rule Interpolation-based Q-learning [O] . Vincze Dávid, Kovács Szilveszter 2015

机译：基于模糊规则插值的Q学习中的规则库约简

Fuzzy interpolation-based Q-learning with profit sharing plan scheme

摘要

著录项

相似文献

相关主题

期刊订阅