首页> 外文期刊>Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies >Missing data imputation by K nearest neighbours based on grey relational structure and mutual information
【24h】

Missing data imputation by K nearest neighbours based on grey relational structure and mutual information

机译:基于灰色关联结构和互信息的K个最近邻缺失数据归因

获取原文
获取原文并翻译 | 示例
           

摘要

Treatment of missing data has become increasingly significant in scientific research and engineering applications. The classic imputation strategy based on the K nearest neighbours (KNN) has been widely used to solve the plague problem. However, former studies do not give much attention to feature relevance, which has a significant impact on the selection of nearest neighbours. As a result, biased results may appear in similarity measurements. In this paper, we propose a novel method to impute missing data, named feature weighted grey KNN (FWGKNN) imputation algorithm. This approach employs mutual information (MI) to measure feature relevance. We present an experimental evaluation for five UCI datasets in three missingness mechanisms with various missing rates. Experimental results show that feature relevance has a non-ignorable influence on missing data estimation based on grey theory, and our method is considered superior to the other four estimation strategies. Moreover, the classification bias can be significantly reduced by using our approach in classification tasks.
机译:在科学研究和工程应用中,缺失数据的处理变得越来越重要。基于K最近邻(KNN)的经典归因策略已被广泛用于解决瘟疫问题。但是,以前的研究并没有过多关注特征相关性,这对选择最近的邻居有重大影响。结果,在相似性测量中可能会出现偏差结果。在本文中,我们提出了一种新的填补缺失数据的方法,称为特征加权灰色KNN(FWGKNN)归因算法。这种方法采用互信息(MI)来度量特征相关性。我们提出了三种失踪率不同的三种失踪机制下的五个UCI数据集的实验评估。实验结果表明,基于灰色理论的特征相关性对丢失数据的估计具有不可忽视的影响,我们的方法被认为优于其他四种估计策略。此外,通过在分类任务中使用我们的方法,可以大大减少分类偏差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号