首页> 外文会议>Machine learning(ML95) >Efficient Algorithms for Finding Multi-way Splits for Decision Trees
【24h】

Efficient Algorithms for Finding Multi-way Splits for Decision Trees

机译:查找决策树多路拆分的高效算法

获取原文
获取原文并翻译 | 示例

摘要

A number of recent papers have pointed out that binary decision trees are not always the best model for some domains. In particular, for some distributions the best way to partition a set of examples might be to find a set of intervals for a given feature, and split the examples up into several groups based on those intervals. Binary decision tree induction methods pick a single split point, i.e., they consider only bi-partitions at a node in the tree. We have developed an efficient new algorithm that computes an optimal multisplit of an interval into k sub-intervals, for any fixed k less than the number of examples. The algorithm employs a penalty function for increasing values of k to prevent it from splitting the examples into trivial partitions. Our implementation demonstrates both the efficiency of this method and the kinds of distributions for which it can produce better decision trees.
机译:最近的许多论文指出,二进制决策树并不总是某些领域的最佳模型。特别是,对于某些发行版,划分一组示例的最佳方法可能是为给定特征找到一组间隔,然后根据这些间隔将示例分成几组。二叉决策树归纳方法选择单个分割点,即,它们仅考虑树中节点处的双分区。我们已经开发了一种有效的新算法,该算法可将间隔的最佳多重分割计算为k个子间隔,对于任何小于示例数的固定k。该算法采用惩罚函数来增加k的值,以防止其将示例拆分为琐碎的分区。我们的实现既证明了这种方法的效率,又证明了它可以产生更好的决策树的分布类型。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号