Constructing Efficient Decision Trees by Using Optimized Numeric Association Rules

机译：使用优化的数字关联规则构造有效的决策树

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose an extension of an entropy-based heuristic of Quinlan [Q93] for constructing a decision tree from a large database with many numeric attributes. Quinlan pointed out that his original method (as well as other existing methods) may be inefficient if any numeric attributes are strongly correlated. Our approach offers one solution to this problem. For each pair of numeric attributes with strong correlation, we compute a two-dimensional association rule with respect to these attributes and the objective attribute of the decision tree. In particular, we consider a family R of grid-regions in the plane associated with the pair of attributes. For R is not an element of R, the data can be split into two classes: data inside R and data outside R. We compute the region R_(opt) is not an element of R that minimizes the entropy of the splitting, and add the splitting associated with R_(opt) (for each pair of strongly correlated attributes) to the set of candidate tests in Quinlan's entropy-based heuristic.

机译：我们建议对基于Quinn [Q93]的启发式算法进行扩展，以从具有许多数值属性的大型数据库中构建决策树。 Quinlan指出，如果任何数字属性都高度相关，那么他的原始方法（以及其他现有方法）可能效率不高。我们的方法为该问题提供了一种解决方案。对于具有强相关性的每对数字属性，我们针对这些属性和决策树的客观属性计算一个二维关联规则。特别地，我们考虑与该对属性关联的平面中网格区域的族R。因为R不是R的元素，所以数据可以分为两类：R内的数据和R之外的数据。我们计算区域R_（opt）不是R的元素，它使分割的熵最小，然后加将与R_（opt）相关联的拆分（针对每对高度相关的属性）划分为基于Quinlan的基于启发式的启发式测试中的候选测试集。

著录项

来源
《Twenty-Second international conference on very large data bases(VLDB'96)》|1996年|p.146-155|共10页
会议地点 Mumbai(IN);Mumbai(IN)
作者
Takeshi Fukuda; Yasuhiko Morimoto; Shinichi Morishita;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Privacy Preserving Optimized Rules Mining from Decision Tables and Decision Trees [J] . Ahmed Saeed Alzahrani, Muhammad Shuaib Qureshi Indian Journal of Science and Technology . 2012,第6期

机译：从决策表和决策树中保护隐私的优化规则挖掘
2. Efficient pyrolysis of ginkgo biloba leaf residue and pharmaceutical sludge (mixture) with high production of clean energy: Process optimization by particle swarm optimization and gradient boosting decision tree algorithm [J] . Bioresource Technology: Biomass, Bioenergy, Biowastes, Conversion Technologies, Biotransformations, Production Technologies . 2020,第期

机译：高效清洁能源的高效热解热解法和药物污泥（混合物）：通过粒子群优化和梯度升压决策树算法的过程优化
3. Incremental Optimization Mechanism for Constructing a Decision Tree in Data Stream Mining [J] . Hang Yang, Simon Fong Mathematical Problems in Engineering . 2013,第pta2期

机译：数据流挖掘中构建决策树的增量优化机制
4. Constructing Efficient Decision Trees by Using Optimized Numeric Association Rules [C] . Takeshi Fukuda, Yasuhiko Morimoto, Shinichi Morishita International conference on very large data bases . 1996

机译：通过使用优化的数字关联规则构建有效的决策树
5. Developing a genetic algorithm to construct efficient binary decision trees. [D] . Nwosisi, Christopher. 2010

机译：开发遗传算法以构建有效的二元决策树。
6. Seven-Spot Ladybird Optimization: A Novel and Efficient Metaheuristic Algorithm for Numerical Optimization [O] . Peng Wang, Zhouquan Zhu, Shuai Huang 2013

机译：七点瓢虫优化：一种新颖高效的数值启发式数值优化算法
7. Optimization Analysis of Improved Association Rules Based on Decision Tree and Data Clustering [O] . Qing Tan 2018

机译：基于决策树和数据群集的改进关联规则的优化分析

Constructing Efficient Decision Trees by Using Optimized Numeric Association Rules

摘要

著录项

相似文献

相关主题

期刊订阅