首页> 外文会议>International Conference on Intelligent Data Engineering and Automated Learning >Imputation of Missing Boarding Stop Information in Smart Card Data with Machine Learning Methods
【24h】

Imputation of Missing Boarding Stop Information in Smart Card Data with Machine Learning Methods

机译:使用机器学习方法丢失登机停止信息的归责

获取原文

摘要

With the increase in population densities and environmental awareness, public transport has become an important aspect of urban life. Consequently, large quantities of transportation data are generated, and mining data from smart card use has become a standardized method to understand the travel habits of passengers. Increase in available data and computation power demands more sophisticated methods to analyze big data. Public transport datasets, however, often lack data integrity. Boarding stop information may be missing either due to imperfect acquirement processes or inadequate reporting. As a result, large quantities of observations and even complete sections of cities might be absent from the smart card database. We have developed a machine (supervised) learning method to impute missing boarding stops based on ordinal classification. In addition, we present a new metric, Pareto Accuracy, to evaluate algorithms where classes have an ordinal nature. Results are based on a case study in the city of Beer Sheva utilizing one month of data. We show that our proposed method significantly outperforms schedule-based imputation methods and can improve the accuracy and usefulness of large-scale transportation data. The implications for data imputation of smart card information is further discussed.
机译:随着人口密度和环境意识的增加,公共交通已成为城市生活的一个重要方面。因此,生成了大量的运输数据,从智能卡使用中挖掘数据已成为理解乘客旅行习惯的标准化方法。可用数据和计算功率的增加需要更复杂的方法来分析大数据。但是,公共交通数据集通常缺乏数据完整性。寄宿停止信息可能缺少由于不完美的获取过程或报告不足。因此,智能卡数据库可能缺少大量观测和城市的完整部分。我们开发了一种机器(监督)学习方法,以基于序数分类赋予缺失的登机终止。此外,我们提出了一种新的公制,帕累托准确性,以评估类具有序数性质的算法。结果是基于利用一个月的数据啤酒谢维市的案例研究。我们表明我们所提出的方法显着优于基于时间表的估算方法,可以提高大规模运输数据的准确性和有用性。进一步讨论了对数据载体的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号