Learning optimization for decision tree classification of non-categorical data with information gain impurity criterion

机译：具有信息增益杂质准则的非分类数据决策树分类的学习优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider the problem of construction of decision trees in cases when data is non-categorical and is inherently high-dimensional. Using conventional tree growing algorithms that either rely on univariate splits or employ direct search methods for determining multivariate splitting conditions is computationally prohibitive. On the other hand application of standard optimization methods for finding locally optimal splitting conditions is obstructed by abundance of local minima and discontinuities of classical goodness functions such as e.g. information gain or Gini impurity. In order to avoid this limitation a method to generate smoothed replacement for measuring impurity of splits is proposed. This enables to use vast number of efficient optimization techniques for finding locally optimal splits and, at the same time, decreases the number of local minima. The approach is illustrated with examples.

机译：当数据是非分类的并且本质上是高维的情况下，我们考虑构造决策树的问题。在计算上，使用依赖于单变量拆分或采用直接搜索方法来确定多元拆分条件的常规树木生长算法是无法实现的。另一方面，标准优化方法在寻找局部最优分裂条件上的应用由于局部极小值的丰富和经典善函数的不连续性而受到阻碍，例如信息增益或基尼杂质。为了避免这种限制，提出了一种生成平滑替换以测量分割的杂质的方法。这使得可以使用大量有效的优化技术来查找局部最优分割，同时减少局部最小值的数量。通过示例说明了该方法。

著录项

来源
《International Joint Conference on Neural Networks》|2014年|3548-3555|共8页
会议地点
作者
Sofeikov K.I.; Tyukin I.Yu.; Gorban A.N.; Mirkes E.M.; Prokhorov D.V.; Romanenko I.V.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Robust Machine Learning Applied to Astronomical Data Sets. I. Star-Galaxy Classification of the Sloan Digital Sky Survey DR3 Using Decision Trees [J] . Nicholas M. Ball12, Robert J. Brunner12, Adam D. Myers12, The Astrophysical journal . 2008,第1期

机译：强大的机器学习应用于天文数据集。 I.使用决策树对斯隆数字天空测量DR3进行星系分类
2. ROBUST MACHINE LEARNING APPLIED TO ASTRONOMICAL DATA SETS. I. STAR-GALAXY CLASSIFICATION OF THE SLOAN DIGITAL SKY SURVEY DR3 USING DECISION TREES [J] . NICHOLAS M. BALL, ROBERT J. BRUNNER, ADAM D. MYERS, The Astrophysical journal . 2006,第1Pt1期

机译：应用于天文数据集的鲁棒机器学习。 I.决策树对斯隆数字天空调查DR3的星系分类
3. ROBUST MACHINE LEARNING APPLIED TO ASTRONOMICAL DATA SETS. I. STAR-GALAXY CLASSIFICATION OF THE SLOAN DIGITAL SKY SURVEY DR3 USING DECISION TREES [J] . NICHOLAS M. BALL, ROBERT J. BRUNNER, ADAM D. MYERS, The Astrophysical journal . 2006,第1Pt1期

机译：应用于天文数据集的鲁棒机器学习。 I.决策树对斯隆数字天空调查DR3的星系分类
4. Learning optimization for decision tree classification of non-categorical data with information gain impurity criterion [C] . Sofeikov K.I., Tyukin I.Yu., Gorban A.N., International Joint Conference on Neural Networks . 2014

机译：学习优化与信息增益杂质标准的非分类数据的决策树分类
5. Decision tree-based machine learning algorithm for in-node vehicle classification [D] . Trivedi, Ankit P. 2016

机译：基于决策树的节点内车辆分类机器学习算法
6. OmniGA: Optimized Omnivariate Decision Trees for Generalizable Classification Models [O] . Arturo Magana-Mora, Vladimir B. Bajic -1

机译：OmniGA：用于通用分类模型的优化全变量决策树
7. Learning optimization for decision tree classification of non-categorical data with information gain impurity criterion [O] . K. I. Sofeikov, I. Yu. Tyukin, A. N. Gorban, 2014

机译：学习优化与信息增益杂质标准的非分类数据的决策树分类

Learning optimization for decision tree classification of non-categorical data with information gain impurity criterion

摘要

著录项

相似文献

相关主题

期刊订阅