首页> 外文OA文献 >Good methods for coping with missing data in decision trees
【2h】

Good methods for coping with missing data in decision trees

机译:处理决策树中缺失数据的好方法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

We propose a simple and effective method for dealing with missing data in decision trees used for classification. We call this approach 'missingness incorporated in attributes' (MIA). It is very closely related to the technique of treating 'missing' as a category in its own right, generalizing it for use with continuous as well as categorical variables. We show through a substantial data-based study of classification accuracy that MIA exhibits consistently good performance across a broad range of data types and of sources and amounts of missingness. It is competitive with the best of the rest (particularly, a multiple imputation EM algorithm method; EMMI) while being conceptually and computationally simpler. A simple combination of MIA and EMMI is slower but even more accurate.
机译:我们提出了一种简单有效的方法来处理用于分类的决策树中的缺失数据。我们称这种方法为“缺少属性的属性”(MIA)。它与将“缺失”本身视为一个类别,将其概括为可用于连续变量和分类变量的技术密切相关。通过对分类准确性进行基于数据的大量研究表明,MIA在广泛的数据类型以及各种来源和缺失量中始终显示出良好的性能。它在其余方面(尤其是多重插补EM算法方法; EMMI)中的其他优点中具有优势,同时在概念和计算上更简单。 MIA和EMMI的简单组合比较慢,但更准确。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号