A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach

机译：缺少数据值的新索引测量估算算法：机器学习方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The problem of missing data in the real world datasets has very significant role in the real time data mining process and becomes more complex in large databases. The presence of missing values influences data set features and the class attributes, thus affecting the predictive accuracies of the classifiers. For the last one decade, many researchers have come out with different techniques for dealing with missing attribute values in databases with homogeneous and/or numeric attributes. In this research work, we proposed a new indexing measure to the imputation algorithm for missing data values of the attributes to compute the similarity measure between any two typical elements in the dataset. It can also be applied on any dataset be it a nominal and/or real. The proposed algorithm is evaluated by extensive experiments and comparison with KNNI, SVMI, WKNNI, KMI and FKMI algorithms. The results showed that the proposed algorithm has better performance than the existing imputation algorithms in terms of classification accuracy and also our decision tree algorithm employs highly accurate decision rules.

机译：在真实数据挖掘过程中，真实世界数据集中缺少数据的问题在实时数据挖掘过程中具有非常重要的作用，并且在大型数据库中变得更加复杂。缺失值的存在影响数据集特征和类属性，从而影响分类器的预测精度。对于最后一个十年来，许多研究人员已经出现了不同的技术，用于处理具有同类和/或数字属性的数据库中的缺失的属性值。在这项研究工作中，我们提出了一种新的索引测量来缺少属性的数据值的撤消算法，以计算数据集中的任何两个典型元素之间的相似度量。它也可以应用于任何数据集是标称和/或真实的。所提出的算法通过广泛的实验和knni，SVMI，WKNNI，KMI和FKMI算法进行评估。结果表明，该算法在分类准确性方面具有比现有估算算法更好的性能，并且我们的决策树算法采用了高度准确的决策规则。

著录项

来源
《International Conference on Computational Intelligence and Computing Research》|2012年||共7页
会议地点
作者
G.Madhu; T.V.Rajinikanth;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.4-532;
关键词
classification; decision tree; index measure; missing values;

机译：分类;决策树;索引测量;缺失值;

相似文献

外文文献
中文文献
专利

1. Deep Learning Approach for Imputation of Missing Values in Actigraphy Data: Algorithm Development Study [J] . Jong-Hwan Jang, Junggu Choi, Hyun Woong Roh, JMIR mHealth and uHealth . 2020,第7期

机译：激光数据缺失值归咎的深度学习方法：算法开发研究
2. A New Imputation Algorithm Based Approach for Missing Attribute Values in Databases: An Experimental Approach [J] . Madhu G International Journal of Artificial Intelligence and Knowledge Discovery . 2013,第4期

机译：一种基于归因算法的数据库缺失属性值的新方法：一种实验方法
3. A novel approach for imputation of missing continuous attribute values in databases using genetic algorithm [J] . R. Devi Priya, S. Kuppuswami International journal of infomation technology and management . 2015,第2a3期

机译：利用遗传算法对数据库中连续属性值缺失进行估算的新方法
4. A novel index measure imputation algorithm for missing data values: A machine learning approach [C] . Madhu G., Rajinikanth T.V. 2012 IEEE International Conference on Computational Intelligence amp; Computing Research . 2012

机译：缺失数据值的新型索引度量插补算法：一种机器学习方法
5. Missing covariates in causal inference matching: Statistical imputation using machine learning and evolutionary search algorithms [D] . Hurley, Landon. 2017

机译：因果推理匹配中缺少协变量：使用机器学习和进化搜索算法进行统计插补
6. Providing an imputation algorithm for missing values of longitudinal data using Cuckoo search algorithm: A case study on cervical dystonia [O] . Amin Golabpour, Kobra Etminani, Hassan Doosti, 2017

机译：使用杜鹃搜索算法为纵向数据的缺失值提供插补算法：以宫颈肌张力障碍为例
7. The impact of imputation procedures with machine learning methods on the performance of classifiers: An application to coronary artery disease data including missing values [O] . Jale Bektas, Turgay Ibrikci, Ismail Turkay Ozcan 2018

机译：用机器学习方法对分类器性能的估算方法的影响：冠状动脉疾病数据的应用，包括缺失值

A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach

摘要

著录项

相似文献

相关主题

期刊订阅