首页> 外文会议>Machine learning(ML95) >Increasing the performance and consistency of classification trees by using the accuracy criterion at the leaves
【24h】

Increasing the performance and consistency of classification trees by using the accuracy criterion at the leaves

机译:通过使用叶子处的准确性标准来提高分类树的性能和一致性

获取原文
获取原文并翻译 | 示例

摘要

The traditional split criteria in tree induction (Gini, Entropy and others) do not minimize the number of misclassifications at each node, and hence cannot correctly estimate the parameters of a tree, even if the underlying model can be correctly modeled by a tree procedure. We examine this effect and show that the difference in accuracy can be as much as 15precent in the worst case. We prove that using the Gini criterion, trees unbounded in size may be grown in order to correctly estimate a model. We then give a procedure that is guaranteed to give finite trees and define a modification to the standard tree growing methodology that results in improvements in predictive accuracy from 1precent to 5precent on datasets from the UCI repository.
机译:树归纳中的传统拆分标准(Gini,熵和其他)不会最小化每个节点上的错误分类数量,因此即使基础模型可以通过树过程正确建模,也无法正确估计树的参数。我们检查了这种影响,并表明在最坏的情况下,精度差异可能高达15%。我们证明,使用基尼标准,可以生长大小不受限制的树木,以便正确估计模型。然后,我们给出一个保证给出有限树的程序,并定义对标准树生长方法的修改,从而使UCI存储库中的数据集的预测准确性从1%提高到5%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号