首页> 外文会议>International Conference on Advanced Computing >Imputing missing values using Inverse Distance Weighted Interpolation for time series data
【24h】

Imputing missing values using Inverse Distance Weighted Interpolation for time series data

机译:使用时间距离数据的反距离加权插值插值缺失值

获取原文

摘要

Data mining is the process of analyzing and retrieving meaningful information from a database. Temporal data mining deals with time stamped data. In the real world, temporal data obtained may contain noisy, inconsistent data and in most cases the data may be missing; hence data preprocessing is one of the important steps that has to be carried out in data mining. Missing values may generate biased results and affect the accuracy of classification. In order to overcome this it is necessary to impute the missing values based on other information in the dataset. The work focuses on imputing missing values using Inverse Distance Weighted Interpolation method which best suits for data sampled at uneven intervals of time. This method assigns values to unknown points from a weighted sum of values of known points. Machine learning techniques applied to the imputed dataset will give better accuracy than that of the incomplete dataset.
机译:数据挖掘是分析和检索数据库中有意义的信息的过程。时态数据挖掘处理带有时间戳的数据。在现实世界中,获得的时间数据可能包含嘈杂的,不一致的数据,并且在大多数情况下,这些数据可能会丢失;因此,数据预处理是数据挖掘中必须执行的重要步骤之一。缺少值可能会产生有偏差的结果,并影响分类的准确性。为了克服这个问题,有必要根据数据集中的其他信息来估算缺失值。这项工作着重于使用反距离加权插值法估算缺失值,该方法最适合于在不均匀的时间间隔内采样的数据。该方法根据已知点的值的加权总和将值分配给未知点。应用于估算数据集的机器学习技术将比不完整数据集提供更高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号