首页> 外文会议>International Joint Conference on Neural Networks >On Evaluating Data Preprocessing Methods for Machine Learning Models for Flight Delays
【24h】

On Evaluating Data Preprocessing Methods for Machine Learning Models for Flight Delays

机译:航班延误机器学习模型的数据预处理方法评估

获取原文

摘要

Flight delays cause various inconveniences for airlines, airports, and passengers. According to data provided by the Brazilian National Civil Aviation Agency (ANAC), between 2009 and 2015, about 22% of domestic flights made in Brazil were delayed by more than 15 minutes. The prediction of these delays is fundamental to mitigate their occurrence and optimize the decision-making process of an air transport system. Particularly, airlines, airports, and users may be more interested in when delays are likely to occur than the accurate prediction of the absence of delays. This paper focuses on the unbalanced distribution of the classes of delay (presence and absence) by performing an experimental evaluation of several preprocessing methods for the development of machine-learning flight delay classification models. Those models were built from a dataset that integrates national flight operations with meteorological conditions of airports. Our results indicate the models that applied the balancing techniques performed much better in predicting the occurrence of delays, getting about 60% of hits.
机译:航班延误给航空公司,机场和乘客带来各种不便。根据巴西国家民航局(ANAC)提供的数据,在2009年至2015年之间,巴西约有22%的国内航班延误了15分钟以上。这些延误的预测对于减轻延误的发生和优化航空运输系统的决策过程至关重要。尤其是,航空公司,机场和用户可能对延迟可能发生的时间更感兴趣,而不是对不存在延迟的准确预测。本文通过对开发机器学习航班延误分类模型的几种预处理方法进行实验评估,着重研究延误类别(存在和不存在)的不平衡分布。这些模型是根据将国家航班运营与机场气象条件整合在一起的数据集构建的。我们的结果表明,应用平衡技术的模型在预测延迟的发生方面表现更好,获得了约60%的点击。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号