A novel index measure imputation algorithm for missing data values: A machine learning approach

机译：缺失数据值的新型索引度量插补算法：一种机器学习方法

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

The problem of missing data in the real world datasets has very significant role in the real time data mining process and becomes more complex in large databases. The presence of missing values influences data set features and the class attributes, thus affecting the predictive accuracies of the classifiers. For the last one decade, many researchers have come out with different techniques for dealing with missing attribute values in databases with homogeneous and/or numeric attributes. In this research work, we proposed a new indexing measure to the imputation algorithm for missing data values of the attributes to compute the similarity measure between any two typical elements in the dataset. It can also be applied on any dataset be it a nominal and/or real. The proposed algorithm is evaluated by extensive experiments and comparison with KNNI, SVMI, WKNNI, KMI and FKMI algorithms. The results showed that the proposed algorithm has better performance than the existing imputation algorithms in terms of classification accuracy and also our decision tree algorithm employs highly accurate decision rules.

机译：现实世界数据集中的数据丢失问题在实时数据挖掘过程中起着非常重要的作用，并且在大型数据库中变得更加复杂。缺失值的存在会影响数据集功能和类属性，从而影响分类器的预测准确性。在过去的十年中，许多研究人员提出了不同的技术来处理具有均质和/或数值属性的数据库中缺少的属性值。在这项研究工作中，我们为属性缺失数据值的插补算法提出了一种新的索引度量，以计算数据集中任意两个典型元素之间的相似性度量。它也可以应用于标称和/或实数的任何数据集。通过广泛的实验对提出的算法进行了评估，并与KNNI，SVMI，WKNNI，KMI和FKMI算法进行了比较。结果表明，与分类算法相比，该算法具有更好的分类精度，并且决策树算法采用了高精度的决策规则。

著录项

来源
《2012 IEEE International Conference on Computational Intelligence amp; Computing Research》|2012年|p.1-7|共7页
会议地点 Coimbatore(IN)
作者
Madhu G.; Rajinikanth T.V.;
展开▼
作者单位

Dept of Information Technology, VNR VJIET, Hyderabad, Andhra Pradesh, 500090, India;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词
classification; decision tree; index measure; missing values;

机译：分类;决策树;指标测度;缺失值;;

相似文献

外文文献
中文文献
专利

1. Deep Learning Approach for Imputation of Missing Values in Actigraphy Data: Algorithm Development Study [J] . Jong-Hwan Jang, Junggu Choi, Hyun Woong Roh, JMIR mHealth and uHealth . 2020,第7期

机译：激光数据缺失值归咎的深度学习方法：算法开发研究
2. A New Imputation Algorithm Based Approach for Missing Attribute Values in Databases: An Experimental Approach [J] . Madhu G International Journal of Artificial Intelligence and Knowledge Discovery . 2013,第4期

机译：一种基于归因算法的数据库缺失属性值的新方法：一种实验方法
3. A novel approach for imputation of missing continuous attribute values in databases using genetic algorithm [J] . R. Devi Priya, S. Kuppuswami International journal of infomation technology and management . 2015,第2a3期

机译：利用遗传算法对数据库中连续属性值缺失进行估算的新方法
4. A Novel Index Measure Imputation Algorithm for Missing Data Values: A Machine Learning Approach [C] . G.Madhu, T.V.Rajinikanth International Conference on Computational Intelligence and Computing Research . 2012

机译：缺少数据值的新索引测量估算算法：机器学习方法
5. Missing covariates in causal inference matching: Statistical imputation using machine learning and evolutionary search algorithms [D] . Hurley, Landon. 2017

机译：因果推理匹配中缺少协变量：使用机器学习和进化搜索算法进行统计插补
6. Providing an imputation algorithm for missing values of longitudinal data using Cuckoo search algorithm: A case study on cervical dystonia [O] . Amin Golabpour, Kobra Etminani, Hassan Doosti, 2017

机译：使用杜鹃搜索算法为纵向数据的缺失值提供插补算法：以宫颈肌张力障碍为例
7. The impact of imputation procedures with machine learning methods on the performance of classifiers: An application to coronary artery disease data including missing values [O] . Jale Bektas, Turgay Ibrikci, Ismail Turkay Ozcan 2018

机译：用机器学习方法对分类器性能的估算方法的影响：冠状动脉疾病数据的应用，包括缺失值

A novel index measure imputation algorithm for missing data values: A machine learning approach

摘要

著录项

相似文献

相关主题

期刊订阅