首页> 外文期刊>Pattern recognition letters >Good methods for coping with missing data in decision trees
【24h】

Good methods for coping with missing data in decision trees

机译:解决决策树中缺失数据的好方法

获取原文
           

摘要

We propose a simple and effective method for dealing with missing data in decision trees used for classification. We call this approach "missingness incorporated in attributes" (MIA). It is very closely related to the technique of treating "missing" as a category in its own right, generalizing it for use with continuous as well as categorical variables. We show through a substantial data-based study of classification accuracy that MIA exhibits consistently good performance across a broad range of data types and of sources and amounts of missingness. It is competitive with the best of the rest (particularly, a multiple imputation EM algorithm method; EMMI) while being conceptually and computationally simpler. A simple combination of MIA and EMMI is slower but even more accurate.
机译:我们提出了一种简单有效的方法来处理用于分类的决策树中的缺失数据。我们称这种方法为“缺少属性的属性”(MIA)。它与将“缺失”本身视为一个类别的技术紧密相关,将其概括化以用于连续变量和分类变量。通过对分类准确性进行基于数据的大量研究表明,MIA在广泛的数据类型以及各种来源和缺失量中始终显示出良好的性能。它在其余方面(尤其是多重插补EM算法方法; EMMI)中的其他优点中具有优势,同时在概念和计算上更简单。 MIA和EMMI的简单组合比较慢,但更准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号