首页> 外文期刊>Annals of data science >Analytical Split Value Calculation for Numerical Attributes in Hoeffding Trees with Misclassification-Based Impurity
【24h】

Analytical Split Value Calculation for Numerical Attributes in Hoeffding Trees with Misclassification-Based Impurity

机译:基于错误分类的杂质树木数值属性的分析分流计算

获取原文
获取原文并翻译 | 示例
           

摘要

Hoeffding tree is a method to incrementally build decision trees. A common approach to handle numerical attributes in Hoeffding trees is to represent their sufficient statistics as Gaussian distributions. Our contribution in this paper is to prove that by using Gaussian distribution as sufficient statistics and misclassification error as impurity measure, there is an analytical method to exactly calculate the best splitting values. Three different approaches for using this theorem are proposed and all three are tested on both synthetic and real datasets. The experiments suggest that this approach can create smaller trees and learn faster and achieve higher accuracy in most problems.
机译:Hoeffding树是一种逐渐构建决策树的方法。 处理Hoeffding树中数值属性的常见方法是将其充分的统计数据表示为高斯分布。 我们本文的贡献是为了证明,通过使用高斯分布作为足够的统计和错误分类误差作为杂质测量,存在分析方法来完全计算最佳分裂值。 提出了三种不同的使用本定理方法,所有三种都在合成和实际数据集上进行测试。 实验表明,这种方法可以在大多数问题中创造较小的树木并学习更快,更高的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号