An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand

Ying Cheng-shuo; Chow Andy H. F.; Chin Kwai-Sang

首页> 外文期刊>Transportation Research Part B: Methodological >An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand

【24h】

An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand

机译：随机需求下滚动股票循环的地铁列车调节探测深度加强学习方法

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a novel actor-critic deep reinforcement learning approach for metro train scheduling with circulation of limited rolling stock. The scheduling problem is modeled as a Markov decision process driven by stochastic passenger demand. As in most dynamic optimization problems, the complexity of the scheduling process grows exponentially with the amount of states, decisions, and uncertainties involved. This study aims to address this 'curses of dimensionality' issue by adopting an actor-critic deep reinforcement learning solution framework. The framework simplifies the evaluation and searching process for potential optimal solutions by parameterizing the original state and decision spaces with the use of artificial neural networks. A deep deterministic policy gradient algorithm is developed for training the artificial neural networks via simulated system transitions before the actor-critic agent can be applied for online schedule control. The proposed approach is tested with a real-world scenario configured with data collected from the Victoria Line of London Underground, UK. Experiment results illustrate the advantages of the proposed method over a range of established meta-heuristics in terms of computing time, system efficiency, and robustness under different stochastic environments. This study innovates urban transit operations with state-of-the-art computer science and dynamic optimization techniques. (C) 2020 Elsevier Ltd. All rights reserved.

机译：本文提出了一种新的演员评论家，用于滚动储量流通的地铁列车调度深度增强学习方法。调度问题被建模为由随机乘客需求驱动的马尔可夫决策过程。与大多数动态优化问题一样，调度过程的复杂性与所涉及的状态，决策和不确定性的数量呈指数级增长。本研究旨在通过采用演员 - 评论家的深度加强学习解决方案框架来解决这一“维度”问题的“诅咒”问题。该框架通过使用人工神经网络参数化原始状态和决策空间来简化潜在最佳解决方案的评估和搜索过程。开发了一种深度确定性政策梯度算法，用于通过模拟系统转换训练人工神经网络，然后在演员 - 批评者代理可以应用于在线计划控制。建议的方法是用现实世界的情景，配置了从英国伦敦维多利亚线收集的数据。实验结果说明了所提出的方法在不同随机环境下的计算时间，系统效率和鲁棒性方面在一系列建立的元启发式中的优点。本研究创新了与最先进的计算机科学和动态优化技术的城市过境运营。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Transportation Research Part B: Methodological》 |2020年第10期|210-235|共26页
作者
Ying Cheng-shuo; Chow Andy H. F.; Chin Kwai-Sang;
展开▼
作者单位

City Univ Hong Kong Syst Engn & Engn Management Kowloon Tong Hong Kong Peoples R China;

City Univ Hong Kong Architecture & Civil Engn Kowloon Tong Hong Kong Peoples R China;

City Univ Hong Kong Syst Engn & Engn Management Kowloon Tong Hong Kong Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Metro train scheduling; Stochastic transit demand; Actor-critic architecture; Deep reinforcement learning; Multi-objective optimization;

机译：地铁列车调度;随机运输需求;演员 - 评论家建筑;深增强学习;多目标优化;

相似文献

外文文献
中文文献
专利

1. Energy-Efficient Train Scheduling and Rolling Stock Circulation Planning in a Metro Line: A Linear Programming Approach [J] . Mo Pengli, Yang Lixing, DAriano Andrea, IEEE Transactions on Intelligent Transportation Systems . 2020,第9期

机译：在地铁线路中节能列车调度和滚动股票流通规划：一种线性规划方法
2. An Actor-Critic Deep Reinforcement Learning Approach for Transmission Scheduling in Cognitive Internet of Things Systems [J] . Yang Helin, Xie Xianzhong IEEE systems journal . 2020,第1期

机译：事实互联网传输调度的演员评论家深度加强学习方法
3. Passenger demand oriented train scheduling and rolling stock circulation planning for an urban rail transit line [J] . Yihui Wang, Andrea D’Ariano, Jiateng Yin, Transportation Research Part B: Methodological . 2018,第DECa期

机译：面向乘客需求的城市轨道交通线列车时刻表和机车车辆流通计划
4. Deep Reinforcement Learning Approach for Train Rescheduling Utilizing Graph Theory [C] . Mitsuaki Obara, Takehiro Kashiyama, Yoshihide Sekimoto IEEE International Conference on Big Data . 2018

机译：图论的列车深度调度深度强化学习方法
5. Mars: Multi-Scalable Actor-Critic Reinforcement Learning Scheduler [D] . Baheri, Betis. 2020

机译：火星：多可扩展的演员 - 评论家强化学习调度员
6. On-Demand Channel Bonding in Heterogeneous WLANs: A Multi-Agent Deep Reinforcement Learning Approach [O] . Hang Qi, Hao Huang, Zhiqun Hu, 2020

机译：异构WLAN中的按需信道绑定：多代理深度强化学习方法
7. An Actor-Critic Reinforcement Learning Approach to Minimum age of Information Scheduling in Energy Harvesting Networks [O] . Shiyang Leng, Aylin Yener 2021

机译：能量收集网络中信息调度最低年龄的演员批评者加强学习方法
8. Rolling Stock Circulation Model for Combining and Splitting of Passenger Trains [R] . Fioole, P. J., Kroon, L. G., Maroti, G., 2004

机译：旅客列车组合劈开的车辆循环模型

An actor-critic deep reinforcement learning approach for metro train scheduling with rolling stock circulation under stochastic demand

摘要

著录项

相似文献

相关主题

期刊订阅