首页> 外文会议>Fifth International Conference on Computer and Information Technology(CIT 2005) >A Quantitative Study of the Effect of Missing Data in Classifiers
【24h】

A Quantitative Study of the Effect of Missing Data in Classifiers

机译:定量数据对分类器影响的定量研究

获取原文
获取原文并翻译 | 示例

摘要

In data mining approaches, predictive classification has a wide range of application. However, there are always missing data in the datasets, which affect the accuracy of classifiers. This paper will investigate the influence of missing data to classifier. The sensitivity analysis of six classifiers to missing data is studied in experiments. The results indicate that, in the datasets, when the proportion of missing data exceeds 20%, they do have a huge adverse effect on the prediction accuracy. Among the six classifiers, the Naive Bayesian classifier is the least sensitive to missing data. For the popular missing data treatment methods using prediction model to handle missing data, Naive Bayesian classifier will be preferred.
机译:在数据挖掘方法中,预测分类具有广泛的应用范围。但是,数据集中总是缺少数据,这会影响分类器的准确性。本文将研究缺失数据对分类器的影响。实验研究了六个分类器对缺失数据的敏感性分析。结果表明,在数据集中,当丢失数据的比例超过20%时,它们的确会对预测准确性产生巨大的不利影响。在六个分类器中,朴素贝叶斯分类器对丢失的数据最不敏感。对于使用预测模型处理缺失数据的流行缺失数据处理方法,将首选朴素贝叶斯分类器。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号