A STUDY OF ELEVATOR DYNAMIC SCHEDULING POLICY BASED ON REINFORCEMENT LEARNING

Zong Qun; Song Chao-feng; Xing Guan-sheng

首页> 外文期刊>Elevator world >A STUDY OF ELEVATOR DYNAMIC SCHEDULING POLICY BASED ON REINFORCEMENT LEARNING

【24h】

A STUDY OF ELEVATOR DYNAMIC SCHEDULING POLICY BASED ON REINFORCEMENT LEARNING

机译：基于强化学习的电梯动态调度策略研究。

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

The problem of elevator group scheduling is formulated by the framework of the Markov Decision Process (MDP), and then the elements in the model of a reinforcement learning algorithm are defined. When reinforcement learning is applied, the stochastic action-selected policy and feed-forward neural network are used to handle the problems of exploration and generalization of value function respectively, which are integrated into the value iteration algorithm, called "Q-learning," to build up the whole algorithm for elevator group scheduling. The simulation results demonstrate the good learning ability, good performance and the adaptability for different traffic flows of algorithm scheduling.

机译：在马尔可夫决策过程（MDP）的框架下提出了电梯群调度问题，然后定义了强化学习算法模型中的元素。在应用强化学习时，随机动作选择策略和前馈神经网络分别用于处理价值函数的探索和泛化问题，这些问题被集成到称为“ Q学习”的价值迭代算法中，建立了电梯群调度的整体算法。仿真结果表明，该算法具有良好的学习能力，良好的性能以及对不同流量的算法调度的适应性。

著录项

来源
《Elevator world》 |2006年第1期|p.58-606264|共5页
作者
Zong Qun; Song Chao-feng; Xing Guan-sheng;
展开▼
作者单位

Tianjin University;

展开▼
收录信息美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类其他专用机械与设备;
关键词

相似文献

外文文献
中文文献
专利

1. A Bayesian Reinforcement Learning Algorithm Based on Abstract States for Elevator Group Scheduling Systems [J] . CHENG Yuhu, WANG Xuesong, ZHANG Yiyang 电子学报：英文版 . 2010,第003期

机译：电梯群调度系统中基于抽象状态的贝叶斯强化学习算法
2. A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem [J] . Paternina-Arboleda CD, Das TK Simulation modelling practice and theory: International journal of the Federation of European Simulation Societies . 2005,第5期

机译：一种用于随机批次调度问题的动态控制策略的多主体强化学习方法
3. A Deep Reinforcement Learning Based Scheduling Policy for Reconfigurable Manufacturing Systems [J] . Jiecheng Tang, Konstantinos Salonitis Procedia CIRP . 2021,第a期

机译：基于深度加强学习的可重构制造系统的调度政策
4. Research of elevator group scheduling system based on reinforcement learning algorithm [C] . Liu zheng, Shu Guang, Dong Hui International Conference on Measurement, Information and Control . 2013

机译：基于强化学习算法的电梯群调度系统研究
5. A policy based reinforcement learning approach for jobshop scheduling with high level deadlock detection [D] . Chen, Mengmeng 2014

机译：基于策略的强化学习方法，用于具有高级死锁检测的作业车间调度
6. Intelligent Decision-Making of Scheduling for Dynamic Permutation Flowshop via Deep Reinforcement Learning [O] . Shengluo Yang, Zhigang Xu, Junyi Wang 2021

机译：通过深度加强学习来调度动态排列流程的智能决策
7. Policy based reinforcement learning approach Of Jobshop scheduling with high level deadlock detection [O] . Chen, Mengmeng 2014

机译：具有高级死锁检测的基于策略的Jobshop调度强化学习方法

A STUDY OF ELEVATOR DYNAMIC SCHEDULING POLICY BASED ON REINFORCEMENT LEARNING

摘要

著录项

相似文献

相关主题

期刊订阅