首页> 外文会议>International Conference on Computational Intelligence and Computing Research >A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach
【24h】

A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach

机译:缺少数据值的新索引测量估算算法:机器学习方法

获取原文

摘要

The problem of missing data in the real world datasets has very significant role in the real time data mining process and becomes more complex in large databases. The presence of missing values influences data set features and the class attributes, thus affecting the predictive accuracies of the classifiers. For the last one decade, many researchers have come out with different techniques for dealing with missing attribute values in databases with homogeneous and/or numeric attributes. In this research work, we proposed a new indexing measure to the imputation algorithm for missing data values of the attributes to compute the similarity measure between any two typical elements in the dataset. It can also be applied on any dataset be it a nominal and/or real. The proposed algorithm is evaluated by extensive experiments and comparison with KNNI, SVMI, WKNNI, KMI and FKMI algorithms. The results showed that the proposed algorithm has better performance than the existing imputation algorithms in terms of classification accuracy and also our decision tree algorithm employs highly accurate decision rules.
机译:在真实数据挖掘过程中,真实世界数据集中缺少数据的问题在实时数据挖掘过程中具有非常重要的作用,并且在大型数据库中变得更加复杂。缺失值的存在影响数据集特征和类属性,从而影响分类器的预测精度。对于最后一个十年来,许多研究人员已经出现了不同的技术,用于处理具有同类和/或数字属性的数据库中的缺失的属性值。在这项研究工作中,我们提出了一种新的索引测量来缺少属性的数据值的撤消算法,以计算数据集中的任何两个典型元素之间的相似度量。它也可以应用于任何数据集是标称和/或真实的。所提出的算法通过广泛的实验和knni,SVMI,WKNNI,KMI和FKMI算法进行评估。结果表明,该算法在分类准确性方面具有比现有估算算法更好的性能,并且我们的决策树算法采用了高度准确的决策规则。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号