首页> 外文会议>ICCCI 2010;International conference on computer and computational intelligence >Classification Performance Measure Using Missing Insurance Data: A Comparison Between Supervised Learning Models
【24h】

Classification Performance Measure Using Missing Insurance Data: A Comparison Between Supervised Learning Models

机译:使用缺失的保险数据进行分类绩效评估:监督学习模型之间的比较

获取原文

摘要

The aim of this paper is to measure and compare the classification performance of five supervised learning models in the presence of missing data. These models are the support vector machines, naive Bayes, k-NN, Ripper and the Iogistic discriminant analysis algorithm. Five different proportions of “missingness” are artificially simulated on a test data set obtained from the UCI repository. For each simulated proportion, aIJ model are tested against the data to access the impact on accuracy of the classification. The classification involves determining whether a group of customers are likely to have an insurance cover or not. This is done by scrutinizing the attributes associated with the customers. The models are built using completely observable data. The results show that with a small proportion of missing data, all models perform well with high accuracies. However, as the number of missing values increase, the performance drops with some models experiencing a greater impact on classification accuracy than others. The logistic discriminant analysis algorithm performs better overall, but the naive Bayes model shows more resilience when there is an increase in proportions of missing data in the test data set.
机译:本文的目的是测量和比较在缺少数据的情况下五个监督学习模型的分类性能。这些模型是支持向量机,朴素贝叶斯,k-NN,Ripper和Iogistic判别分析算法。在从UCI存储库获得的测试数据集上,人工模拟了“缺失”的五个不同比例。对于每个模拟比例,均会针对数据测试aIJ模型,以获取对分类准确性的影响。分类涉及确定一组客户是否可能拥有保险。这是通过仔细检查与客户相关联的属性来完成的。这些模型是使用完全可观察的数据构建的。结果表明,在丢失数据的比例很小的情况下,所有模型均具有较高的精度,并且表现良好。但是,随着缺失值数量的增加,某些模型对分类精度的影响要比其他模型大,因此性能下降。逻辑判别分析算法总体上表现更好,但是当测试数据集中缺失数据的比例增加时,朴素的贝叶斯模型显示出更大的弹性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号