首页> 外文会议>International Conference on Research and Education in Mathematics >Sensitivity of Missing Values in Classification Tree for Large Sample
【24h】

Sensitivity of Missing Values in Classification Tree for Large Sample

机译:大型样本分类树中缺失值的敏感性

获取原文
获取外文期刊封面目录资料

摘要

Missing values either in predictor or in response variables are a very common problem in statistics and data mining. Cases with missing values are often ignored which results in loss of information and possible bias. The objectives of our research were to investigate the sensitivity of missing data in classification tree model for large sample. Data were obtained from one of the high level educational institutions in Malaysia. Students' background data were randomly eliminated and classification tree was used to predict students degree classification. The results showed that for large sample, the structure of the classification tree was sensitive to missing values especially for sample contains more than ten percent missing values.
机译:在预测器或响应变量中缺少值是统计和数据挖掘中的一个非常常见的问题。具有缺失值的案例通常被忽略,这导致信息丢失和可能的偏差。我们的研究目标是研究大型样本的分类树模型中缺失数据的敏感性。数据从马来西亚的高级教育机构之一获得。学生的背景数据被随机消除,并使用分类树来预测学生学位分类。结果表明,对于大型样品,分类树的结构对缺失的值敏感,特别是对于样本含有超过10%的缺失值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号