【24h】

Experiences with a weighted decision tree learner

机译:与加权决策树学习者的经历

获取原文

摘要

Machine learning algorithms for inferring decision trees typically choose a single "best" tree to describe the training data. Recent research has shown that classification performance can be significantly improved by voting predictions of multiple, independently produced decision trees. This paper describes an algorithm, OB1, that produces a weighted sum over many possible models. Model weights are determined by the prior probability of the model, as well as the performance of the model during training. We describe an implementation of OB1 that includes all possible decision trees as well as naive Bayesian models within a single option tree. Constructing all possible decision trees is very expensive, growing exponentially in the number of attributes. However it is possible to use the internal structure of the option tree to avoid recomputing values. In addition, the current implementation allows the option tree to be depth bounded. OB1 is compared with a number of other decision tree and instance based learning algorithms using a selection of data sets from the UCI repository and a maximum option tree depth of three attributes. Both information gain and percentage correct are used for the Comparison. For the information gain measure OB1 performs significantly better than the other algorithms. When using percentage correct OB1 is significantly better than all the algorithms except naive Bayes and boosted C5.0 which perform slightly worse than OB1.
机译:用于推断决策树的机器学习算法通常选择单个“最佳”树来描述培训数据。最近的研究表明,通过投票预测,可以显着改善分类性能,这些决策树的预测。本文介绍了一种算法OB1,它在许多可能的模型上产生加权和。模型权重由模型的现有概率确定,以及在训练期间模型的性能。我们描述了OB1的实现,其中包括所有可能的决策树以及单个选项树中的幼稚贝叶斯模型。构建所有可能的决策树非常昂贵,在属性的数量中呈指数级增长。但是,可以使用选项树的内部结构来避免重新计算值。此外,当前实现允许选项树是深度界限的。将OB1与许多其他决策树和基于实例的学习算法进行比较,使用来自UCI存储库的数据集和三个属性的最大选项树深度的数据集。信息增益和百分比正确地用于比较。对于信息,GAIL MATION OB1显着优于其他算法。当使用百分比正确的OB1,OB1明显优于除天真贝叶斯之外的所有算法,并提升C5.0,其比OB1略差。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号