Towards efficient airline disruption recovery with reinforcement learning

Ding Yida; Wandelt Sebastian; Wu GuohuaXu YifanSun Xiaoqian

首页> 外文期刊>Transportation research, Part E. Logistics and transportation review >Towards efficient airline disruption recovery with reinforcement learning

【24h】

Towards efficient airline disruption recovery with reinforcement learning

机译：

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Disruptions to airline schedules precipitate flight delays/cancellations and significant losses for airline operations. The goal of the integrated airline recovery problem is to develop an operational tool that provides the airline with an instant and cost-effective solution concerning aircraft, crew members and passengers in face of the emerging disruptions. In this paper, we formulate a decision recommendation framework which incorporates various recovery decisions including aircraft and crew rerouting, passenger reaccommodation, departure holding, flight cancellation and cruise speed control. Given the computational hardness of solving the mixed -integer nonlinear programming (MINP) model by the commercial solver (e.g., CPLEX), we establish a novel solution framework by incorporating Deep Reinforcement Learning (DRL) to the Variable Neighborhood Search (VNS) algorithm with well-designed neighborhood structures and state evaluator. We utilize Proximal Policy Optimization (PPO) to train the stochastic policy exploited to select neighborhood operations given the current state throughout the Markov Decision Process (MDP). Experimental results show that the objective value generated by our approach is within a 1.5 gap with respect to the optimal/close-to-optimal objective of the CPLEX solver for the small-scale instances, with significant improvement regarding runtime. The pre-trained DRL agent can leverage features/weights obtained from the training process to accelerate the arrival of objective convergence and further improve solution quality, which exhibits the potential of achieving Transfer Learning (TL). Given the inherent intractability of the problem on practical size instances, we propose a method to control the size of the DRL agent's action space to allow for efficient training process. We believe our study contributes to the efforts of airlines in seeking efficient and cost-effective recovery solutions.

著录项

来源
《Transportation research, Part E. Logistics and transportation review》 |2023年第11期|1.1-1.27|共27页
作者
Ding Yida; Wandelt Sebastian; Wu GuohuaXu YifanSun Xiaoqian;
展开▼
作者单位

Beihang Univ;

Air China;

展开▼
收录信息美国《科学引文索引》(SCI);
原文格式 PDF
正文语种英语
中图分类综合运输;
关键词
Airline scheduling; Disruptions; Deep Reinforcement Learning;

相似文献

外文文献
中文文献
专利

1. Investigators from Aeronautics Institute of Technology Release New Data on Androids (Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning) [J] . Robotics & Machine Learning Daily News . 2022,第9期

机译：Investigators from Aeronautics Institute of Technology Release New Data on Androids (Learning Push Recovery Behaviors for Humanoid Walking Using Deep Reinforcement Learning)
2. New Findings Reported from Swiss Federal Institute of Technology Lausanne (EPFL) Describe Advances in Machine Learning (Optimal recovery of unsecured debt via interpretable reinforcement learning) [J] . Robotics & Machine Learning Daily News . 2022,第3期

机译：New Findings Reported from Swiss Federal Institute of Technology Lausanne (EPFL) Describe Advances in Machine Learning (Optimal recovery of unsecured debt via interpretable reinforcement learning)
3. VisRuption: Intuitive and Efficient Visualization of Temporal Airline Disruption Data [J] . P. Rosenthal, L. Pfeiffer, N. H. MüllerP. Ohler Computer Graphics Forum: Journal of the European Association for Computer Graphics . 2013,第1期

机译：VisRuption: Intuitive and Efficient Visualization of Temporal Airline Disruption Data
4. 探测机器人路径规划的基于案例的强化学习算法A Case-Based Reinforcement Learning for Probe Robot Path Planning [C] . . 2002

机译：探测机器人路径规划的基于案例的强化学习算法A Case-Based Reinforcement Learning for Probe Robot Path Planning
5. Reading Recovery™ implementation in Labrador: A two-year longitudinal study of the long-term effects of Reading Recovery™ [D] . Penney, Tracy F. 2002

机译：在拉布拉多实施Reading Recovery™：对Reading Recovery™的长期影响进行的为期两年的纵向研究
6. Novel causes and consequences of overtraining syndrome: the EROS-DISRUPTORS study [O] . Flavio A. Cadegiani, Claudio E. Kater 2019

机译：过度训练综合症的新原因和后果：EROS-DISRUPTORS研究
7. Efficient gene disruption in Saccharomyces cerevisiae using marker cassettes with long homologous arms prepared by the restriction-free cloning strategy [O] . Zhou Yongjin J., Yang Fan, Zhang Sufang, 2011

机译：Efficient gene disruption in saccharomyces cerevisiae using marker cassettes with long homologous arms prepared by the restriction-free cloning strategy
8. Cometary Nuclei and Tidal Disruption: The Geologic Record of Crater Chains on Callisto and Ganymede [R] . Schenk, P. M. , Asphaug, E. , McKinnon, W. B. , 1996

机译：Cometary Nuclei和Tidal Disruption：Callisto和Ganymede的火山口链地质记录

Towards efficient airline disruption recovery with reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅