Constructing Efficient Decision Trees by Using Optimized Numeric Association Rules

机译：通过使用优化的数字关联规则构建有效的决策树

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an extension of an entropy-based heuristic of Quinlan [Q93] for constructing a decision tree from a large database with many numeric attributes. Quinlan pointed out that his original method (as well as other existing methods) may be inefficient if any numeric attributes are strongly correlated. Our approach offers one solution to this problem. For each pair of numeric attributes with strong correlation, we compute a two-dimensional association rule with respect to these attributes and the objective attribute of the decision tree. In particular, we consider a family R of grid-regions in the plane associated with the pair of attributes. For R is not an element of R, the data can be split into two classes: data inside R and data outside R. We compute the region R_(opt) is not an element of R that minimizes the entropy of the splitting, and add the splitting associated with R_(opt) (for each pair of strongly correlated attributes) to the set of candidate tests in Quinlan's entropy-based heuristic.

机译：我们提出了扩展Quinlan的基于熵的启发式，用于从具有许多数字属性的大型数据库构建决策树。昆兰指出，如果任何数字属性强烈相关，他的原始方法（以及其他现有方法）可能效率低下。我们的方法为此问题提供了一个解决方案。对于具有强相关性的每对数字属性，我们对这些属性和决策树的目标属性计算二维关联规则。特别是，我们考虑与一对属性相关联的平面中的网格区域r。对于R不是R的元素，数据可以分为两个类：R和数据之外的数据内部。我们计算区域R_（opt）不是最小化分裂熵的元素，并添加与r_（选择）（对每对强相关的属性）相关联的分裂到奎纳兰的熵的启发式中的候选测试集。

著录项

来源
《International conference on very large data bases》|1996年||共10页
会议地点
作者
Takeshi Fukuda; Yasuhiko Morimoto; Shinichi Morishita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类各种专用数据库;
关键词

相似文献

外文文献
中文文献
专利

1. Privacy Preserving Optimized Rules Mining from Decision Tables and Decision Trees [J] . Ahmed Saeed Alzahrani, Muhammad Shuaib Qureshi Indian Journal of Science and Technology . 2012,第6期

机译：从决策表和决策树中保护隐私的优化规则挖掘
2. Efficient pyrolysis of ginkgo biloba leaf residue and pharmaceutical sludge (mixture) with high production of clean energy: Process optimization by particle swarm optimization and gradient boosting decision tree algorithm [J] . Bioresource Technology: Biomass, Bioenergy, Biowastes, Conversion Technologies, Biotransformations, Production Technologies . 2020,第期

机译：高效清洁能源的高效热解热解法和药物污泥（混合物）：通过粒子群优化和梯度升压决策树算法的过程优化
3. Incremental Optimization Mechanism for Constructing a Decision Tree in Data Stream Mining [J] . Hang Yang, Simon Fong Mathematical Problems in Engineering . 2013,第pta2期

机译：数据流挖掘中构建决策树的增量优化机制
4. Constructing Efficient Decision Trees by Using Optimized Numeric Association Rules [C] . Takeshi Fukuda, Yasuhiko Morimoto, Shinichi Morishita International conference on very large data bases . 1996

机译：通过使用优化的数字关联规则构建有效的决策树
5. Developing a genetic algorithm to construct efficient binary decision trees. [D] . Nwosisi, Christopher. 2010

机译：开发遗传算法以构建有效的二元决策树。
6. Seven-Spot Ladybird Optimization: A Novel and Efficient Metaheuristic Algorithm for Numerical Optimization [O] . Peng Wang, Zhouquan Zhu, Shuai Huang 2013

机译：七点瓢虫优化：一种新颖高效的数值启发式数值优化算法
7. Optimization Analysis of Improved Association Rules Based on Decision Tree and Data Clustering [O] . Qing Tan 2018

机译：基于决策树和数据群集的改进关联规则的优化分析

Constructing Efficient Decision Trees by Using Optimized Numeric Association Rules

摘要

著录项

相似文献

相关主题

期刊订阅