Reinforcement learning with partitioning function system

LI Wei; YE Qing-tai; ZHU Chang-ming

首页> 外文期刊>Journal of Harbin Institute of Technology >Reinforcement learning with partitioning function system

【24h】

Reinforcement learning with partitioning function system

机译：带分区功能系统的强化学习

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The size of state-space is the limiting factor in applying reinforcement learning algorithms to practical cases. A reinforcement learning system with partitioning function (RLWPF) is established, in which state-space is partitioned into several regions. Inside the performance principle of RLWPF is based on a Semi-Markov decision process and has general significance. It can be applied to any reinforcement learning with a large state-space. In RLWPF, the partitioning module dispatches agents into different regions in order to decrease the state-space of each agent. This article proves the convergence of the SARSA algorithm for a Semi-Markov decision process, ensuring the convergence of RLWPF by analyzing the equivalence of two value functions in two Semi-Markov decision processes before and after partitioning. This article can show that the optimal policy learned by RLWPF is consistent with prior domain knowledge. An elevator group system is devised to decrease the average waiting time of passengers. Four agents control four elevator cars respectively. Based on RLWPF, a partitioning module is developed through defining a uniform round trip time as the partitioning criteria, making the wait time of most passengers more or less identical then elevator cars should only answer hall calls in their own region. Compared with ordinary elevator systems and reinforcement learning systems without partitioning module, the performance results show the advantage of RLWPF.

机译：状态空间的大小是将强化学习算法应用于实际案例的限制因素。建立了具有分区功能的强化学习系统（RLWPF），其中状态空间被划分为多个区域。 RLWPF的性能原理内部基于Semi-Markov决策过程，具有一般意义。它可以应用于具有较大状态空间的任何强化学习。在RLWPF中，分区模块将代理分配到不同的区域，以减少每个代理的状态空间。本文通过分析分割前后两个Semi-Markov决策过程中两个值函数的等价性，证明了SARSA算法在Semi-Markov决策过程中的收敛性，从而确保RLWPF的收敛性。本文可以证明RLWPF学习的最佳策略与先前的领域知识是一致的。设计了电梯群系统以减少乘客的平均等待时间。四个代理商分别控制四个电梯轿厢。基于RLWPF，通过将统一的往返时间定义为划分标准来开发划分模块，从而使大多数乘客的等待时间或多或少是相同的，因此电梯轿厢仅应在自己区域内应答门厅呼叫。与没有分区模块的普通电梯系统和加固学习系统相比，性能结果表明了RLWPF的优势。

著录项

来源
《Journal of Harbin Institute of Technology》 |2004年第4期|p.377-381|共5页
作者
LI Wei; YE Qing-tai; ZHU Chang-ming;
展开▼
作者单位

College of Machine and Dynamics Engineering, Shanghai Jiaotong University, Shanghai 200030, China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类一般工业技术;
关键词
multi-agent systems; partitioning; reinforcement learning; elevator;

机译：多主体系统;分区;强化学习;电梯;

相似文献

外文文献
中文文献
专利

1. Reinforcement learning with partitioning function system [J] . LI Wei, YE Qing-tai, ZHU Chang-ming Journal of Harbin Institute of Technology . 2004,第4期

机译：带分区功能系统的强化学习
2. Reinforcement learning with partitioning function system [J] . 李伟, 叶庆泰, 朱昌明哈尔滨工业大学学报（英文版） . 2004,第004期

机译：带分区功能系统的强化学习
3. Building day-ahead bidding functions for seasonal storage systems: A reinforcement learning approach [J] . Jesus Lago, Ecem Sogancioglu, Gowri Suryanarayana, IFAC PapersOnLine . 2019,第4期

机译：建立季节性存储系统的提前招标功能：一种强化学习方法
4. Building day-ahead bidding functions for seasonal storage systems: A reinforcement learning approach [C] . Jesus Lago, Ecem Sogancioglu, Gowri Suryanarayana, IFAC Workshop on Control of Smart Grid and Renewable Energy Systems . 2020

机译：建立季节性存储系统的前方竞标功能：加强学习方法
5. Learning state and action space hierarchies for reinforcement learning using action -dependent partitioning. [D] . Asadi, Mehran. 2006

机译：使用依赖于动作的分区来学习状态和动作空间层次结构，以进行强化学习。
6. Dissociating the Contributions of Independent Corticostriatal Systems to Visual Categorization Learning Through the Use of Reinforcement Learning Modeling and Granger Causality Modeling [O] . Carol A. Seger, Erik J. Peterson, Corinna M. Cincotta, -1

机译：解离独立的皮质纹状体系统到Visual分类学的贡献通过强化学习模型和格兰杰因果关系模型的使用
7. A formal framework for reinforcement learning with function approximation in learning classifier systems [O] . Drugowitsch J, Barry A M 2006

机译：学习分类器系统中具有函数逼近的强化学习的正式框架

Reinforcement learning with partitioning function system

摘要

著录项

相似文献

相关主题

期刊订阅