...
首页> 外文期刊>Transportation research >Hierarchical integrated machine learning model for predicting flight departure delays and duration in series
【24h】

Hierarchical integrated machine learning model for predicting flight departure delays and duration in series

机译:预测航班偏移延迟和持续时间串联的分层集成机器学习模型

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Flight delays may propagate through the entire aviation network and are becoming an important research topic. This paper proposes a novel hierarchical integrated machine learning model for predicting flight departure delays and duration in series rather than in parallel to avoid ambiguity in decision making. The paper analyses the proposed model using various machine learning algorithms in combination with different sampling techniques. The highly noisy, unbalanced, dispersed, and skewed historical high dimensional data provided by an international airline operating in Hong Kong was used to demonstrate the practical application of the model. The result shows that for a 4-h forecast horizon, a constructive neural network machine learning algorithm with the Synthetic Minority Over Sampling Technique-Tomek Links (SMOTETomek) sampling technique was able to achieve better average balanced recall accuracies of 65.5%, 61.5%, 59% for classifying delay status and predicting delay duration at thresholds of 60 min and 30 min, respectively. Similarly, for minority labels, the precision-recall and area under the curve showed that the proposed model achieved better results of 32.44% and 35.14% compared to the parallel model of 26.43% and 21.02% for thresholds of 60 min and 30 min, respectively. The effect of different sampling techniques, sampling approaches, and estimation mechanisms on prediction performance is also studied.
机译:飞行延误可以通过整个航空网络传播,正在成为一个重要的研究主题。本文提出了一种新的等级集成机器学习模型,用于预测飞行偏离延迟和持续时间,而不是并行,以避免决策中的模糊性。本文使用各种机器学习算法与不同的采样技术结合使用各种机器学习算法来分析所提出的模型。在香港运营的国际航空公司提供的高度嘈杂,不平衡,分散和偏斜的历史高维数据,用于证明该模型的实际应用。结果表明,对于4-H预测地平线,具有合成少数群体对采样技术的建设性神经网络机学习算法 - Tomek Links(SmoTetomek)采样技术能够实现65.5%,61.5%的更好的平均平衡召回精度。 59%的分类延迟状态并分别以60分钟和30分钟的阈值预测延迟持续时间。类似地,对于少数民族标签,曲线下的精密召回和面积显示,与60分钟和30分钟的阈值分别为26.43%和21.02%的平行模型,所提出的模型达到32.44%和35.14%的结果。还研究了不同采样技术,采样方法和估计机制对预测性能的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号