...
首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Missing Value Estimation for Mixed-Attribute Data Sets
【24h】

Missing Value Estimation for Mixed-Attribute Data Sets

机译:混合属性数据集的缺失值估计

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Missing data imputation is a key issue in learning from incomplete data. Various techniques have been developed with great successes on dealing with missing values in data sets with homogeneous attributes (their independent attributes are all either continuous or discrete). This paper studies a new setting of missing data imputation, i.e., imputing missing data in data sets with heterogeneous attributes (their independent attributes are of different types), referred to as imputing mixed-attribute data sets. Although many real applications are in this setting, there is no estimator designed for imputing mixed-attribute data sets. This paper first proposes two consistent estimators for discrete and continuous missing target values, respectively. And then, a mixture-kernel-based iterative estimator is advocated to impute mixed-attribute data sets. The proposed method is evaluated with extensive experiments compared with some typical algorithms, and the result demonstrates that the proposed approach is better than these existing imputation methods in terms of classification accuracy and root mean square error (RMSE) at different missing ratios.
机译:缺少数据归因是从不完整数据中学习的关键问题。在处理具有同类属性(它们的独立属性都是连续的或离散的)的数据集中的缺失值方面,已经开发了各种技术,并取得了巨大的成功。本文研究了缺失数据插补的一种新设置,即在具有异构属性(它们的独立属性为不同类型)的数据集中插补缺失数据,称为插补混合属性数据集。尽管此设置中有许多实际应用程序,但没有为估算混合属性数据集而设计的估计器。本文首先针对离散和连续缺失目标值分别提出了两个一致的估计量。然后,提倡基于混合核的迭代估计器来估算混合属性数据集。通过与大量典型算法进行比较,通过大量实验对提出的方法进行了评估,结果表明,在分类丢失率和均方根误差(RMSE)方面,提出的方法优于现有的归因方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号