A New Paradigm for Development of Data Imputation Approach for Missing Value Estimation

Madhu G; Nagachandrika G

首页> 外文期刊>International Journal of Electrical and Computer Engineering >A New Paradigm for Development of Data Imputation Approach for Missing Value Estimation

【24h】

A New Paradigm for Development of Data Imputation Approach for Missing Value Estimation

机译：缺失值估计的数据插补方法开发的新范例

获取原文

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many real-world applications encountered a common issue in data analysis is the presence of missing data value and challenging task in many applications such as wireless sensor networks, medical applications and psychological domain and others. Learning and prediction in the presence of missing value can be treacherous in machine learning, data mining and statistical analysis. A missing value can signify important information about dataset in the mining process. Handling missing data value is a challenging task for the data mining process. In this paper, we propose new paradigm for the development of data imputation method for missing data value estimation based on centroids and the nearest neighbours. Firstly, identify clusters based on the k-means algorithm and calculate centroids and the nearest neighbour data records. Secondly, the nearest distances from complete dataset as well as incomplete dataset from the centroids and estimated the nearest data record which tends to be curse dimensionality. Finally, impute the missing value based nearest neighbour record using statistical measure called z-score. The experimental study demonstrates strengthen of the proposed paradigm for the imputation of the missing data value estimation in dataset. Tests have been run using different types of datasets in order to validate our approach and compare the results with other imputation methods such as KNNI, SVMI, WKNNI, KMI and FKNNI. The proposed approach is geared towards maximizing the utility of imputation with respect to missing data value estimation.

机译：许多现实世界的应用程序在数据分析中遇到一个共同的问题，即无线传感器网络，医疗应用程序和心理领域等许多应用程序中缺少数据值和具有挑战性的任务。在缺少价值的情况下进行学习和预测在机器学习，数据挖掘和统计分析中可能是危险的。缺失值可能表示有关挖掘过程中数据集的重要信息。对于数据挖掘过程来说，处理缺失的数据值是一项艰巨的任务。在本文中，我们提出了一种新的范式，用于开发基于质心和最近邻的缺失数据值估计的数据插补方法。首先，基于k均值算法识别聚类，并计算质心和最近的邻居数据记录。其次，距完整数据集的最近距离以及距质心的不完整数据集，并估计最近的数据记录，该记录往往是诅咒维数。最后，使用称为z得分的统计量度基于缺失值的最近邻居记录。实验研究表明，对于数据集中的缺失数据值估计的推论，该提议范式得到了加强。为了验证我们的方法并将结果与其他插补方法（例如KNNI，SVMI，WKNNI，KMI和FKNNI）进行比较，已使用不同类型的数据集进行了测试。所提出的方法旨在针对缺失数据值估计最大化估算的效用。

著录项

来源
《International Journal of Electrical and Computer Engineering 》 |2016年第6期| 共7页
作者
Madhu G; Nagachandrika G;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算机的应用 ;
关键词

相似文献

外文文献
中文文献
专利

1. A hybrid approach to integrate fuzzy C-means based imputation method with genetic algorithm for missing traffic volume data estimation [J] . Jinjun Tang, Guohui Zhang, Yinhai Wang, Transportation research . 2015 ,第feba期

机译：一种基于模糊C均值的插补方法与遗传算法相结合的混合方法
2. Accurate Tree-based Missing Data Imputation and Data Fusion within the Statistical Learning Paradigm [J] . Antonio D’Ambrosio, Massimo Aria, Roberta Siciliano Journal of Classification . 2012 ,第2期

机译：统计学习范式中基于树的准确缺失数据插补和数据融合
3. Accurate Tree-based Missing Data Imputation and Data Fusion within the Statistical Learning Paradigm [J] . DAmbrosio A., Aria M., Siciliano R. Journal of classification . 2012 ,第2期

机译：基于树的准确的树缺失数据避难和数据融合在统计学习范式范围内
4. A Missing Data Imputation Approach Using Clustering and Maximum Likelihood Estimation [C] . Muammer ALBAYRAK, Kemal TURHAN, Burcin KURT Medical Technologies National Congress . 2017

机译：使用聚类和最大似然估计的缺少数据估算方法
5. Improve Software Defect Estimation with Six Sigma Defect Measures: Empirical Studies with Imputation Techniques on ISBSG Data Repository with a High Ratio of Missing Data [D] . Almakadmeh, Mhammed. 2017

机译：提高六种Sigma缺陷措施的软件缺陷估算：具有高比例的ISBSG数据储存中缺货技术的实证研究
6. A new analytical framework for missing data imputation and classification with uncertainty: Missing data imputation and heart failure readmission prediction [O] . Zhiyong Hu, Dongping Du 2020

机译：一种新的分析框架用于缺少数据避难和不确定性分类：缺少数据归档和心力衰竭入读预测
7. A New Paradigm for Development of Data Imputation Approach for Missing Value Estimation [O] . Madhu G, Nagachandrika G 2016

机译：用于缺少价值估计的数据归档方法的新范式

A New Paradigm for Development of Data Imputation Approach for Missing Value Estimation

摘要

著录项

相似文献

相关主题

期刊订阅