首页> 外文会议>International conference on computer science and it applications >Effective Pre-processing Methods with DTG Big Data by Using MapReduce Techniques
【24h】

Effective Pre-processing Methods with DTG Big Data by Using MapReduce Techniques

机译:通过使用MapReduce技术的DTG大数据有效预处理方法

获取原文

摘要

A huge amount of sensing data is generated by a large number of pervasive IoT devices. In order to find a meaningful information from the big data, pre-processing is essential, in which many outlier data need to be removed because those are deteriorated as time passes. In this paper, big data pre-processing methods are investigated and proposed. To evaluate the pre-processing methods for accurate analysis, we use collection of digital tachograph (DTG) data. We obtained DTG sensing data of six-thousand driving vehicles over a year. We studied five kinds of pre-processing methods: filtering ranges, excluding meaningless values, comparing filters from variables, applying statistical techniques, and finding driving patterns. In addition, we developed MapReduce programming using a Hadoop ecosystem, and deployed a big data to perform pre-processing analysis. Out of the pre-processing steps, we confirmed the proportion of DTG sensing data including any errors is up to 27.09 %. In addition, we approved that outlier data can be well detected, which is difficult to detect through simple range error pre-processing.
机译:通过大量普遍的物联网设备生成大量的感测数据。为了从大数据中找到有意义的信息,预处理是必不可少的,其中需要删除许多异常数据,因为随着时间的推移而恶化。在本文中,研究了大数据预处理方法并提出。为了评估准确分析的预处理方法,我们使用数字Tachograph(DTG)数据的集合。我们在一年内获得了DTG传感数据的六千驾驶车辆。我们研究了五种预处理方法:过滤范围,不包括无意义的值,比较来自变量的过滤器,应用统计技术,找到驾驶模式。此外,我们使用Hadoop生态系统开发了MapReduce编程,并部署了大数据以进行预处理分析。除了预处理步骤之外,我们确认了DTG传感数据的比例,包括任何误差高达27.09%。此外,我们批准了可以通过简单的范围错误预处理来批准良好检测到异常数据,这很难检测到。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号