首页> 外文学位 >A new approach of top-down induction of decision trees for knowledge discovery.
【24h】

A new approach of top-down induction of decision trees for knowledge discovery.

机译:自上而下归纳决策树以进行知识发现的新方法。

获取原文
获取原文并翻译 | 示例

摘要

Top-down induction of decision trees is the most popular technique for classification in the field of data mining and knowledge discovery. Quinlan developed the basic induction algorithm of decision trees, ID3 (1984), and extended to C4.5 (1993). There is a lot of research work for dealing with a single attribute decision-making node (so-called the first-order decision) of decision trees. Murphy and Pazzani (1991) addressed about multiple-attribute conditions at decision-making nodes. They show that higher order decision-making generates smaller decision trees and better accuracy. However, there always exist NP-complete combinations of multiple-attribute decision-makings.; We develop a new algorithm of second-order decision-tree inductions (SODI) for nominal attributes. The induction rules of first-order decision trees are combined by 'AND' logic only, but those of SODI consist of 'AND', 'OR', and 'OTHERWISE' logics. It generates more accurate results and smaller decision trees than any first-order decision tree inductions.; Quinlan used information gains via VC-dimension (Vapnik-Chevonenkis; Vapnik, 1995) for clustering the experimental values for each numerical attribute. However, many researchers have discovered the weakness of the use of VC-dim analysis. Bennett (1997) sophistically applies support vector machines (SVM) to decision tree induction. We suggest a heuristic algorithm (SVMM; SVM for Multi-category) that combines a TDIDT scheme with SVM. In this thesis it will be also addressed how to solve multiclass classification problems.; Our final goal for this thesis is IDSS (Induction of Decision Trees using SODI and SVMM). We will address how to combine SODI and SVMM for the construction of top-down induction of decision trees in order to minimize the generalized penalty cost.
机译:自上而下的决策树归纳是数据挖掘和知识发现领域中最流行的分类技术。 Quinlan开发了决策树的基本归纳算法ID3(1984),并扩展到C4.5(1993)。对于处理决策树的单个属性决策节点(所谓的一阶决策),有很多研究工作。 Murphy和Pazzani(1991)研究了决策节点的多属性条件。他们表明,较高阶的决策产生较小的决策树和更高的准确性。但是,总是存在多属性决策的NP完全组合。我们针对名义属性开发了一种新的二阶决策树归纳算法(SODI)。一阶决策树的归纳规则仅由“ AND”逻辑组合,而SODI的归纳规则由“ AND”,“ OR”和“ OTHERWISE”逻辑组成。与任何一阶决策树归纳法相比,它产生更准确的结果和更小的决策树。昆兰(Quinlan)使用通过VC维度获得的信息(Vapnik-Chevonenkis; Vapnik,1995)将每个数值属性的实验值聚类。但是,许多研究人员发现使用VC-dim分析的缺点。 Bennett(1997)巧妙地将支持向量机(SVM)应用于决策树归纳。我们建议结合TDIDT方案和SVM的启发式算法(SVMM;用于多类别的SVM)。本文还将探讨如何解决多类分类问题。本文的最终目标是IDSS(使用SODI和SVMM的决策树归纳)。我们将解决如何将SODI和SVMM结合起来以构建自顶向下的决策树,以最大程度地减少广义的惩罚成本。

著录项

  • 作者

    Lee, Jun-Youl.;

  • 作者单位

    Iowa State University.$bIndustrial and Manufacturing Systems Engineering.;

  • 授予单位 Iowa State University.$bIndustrial and Manufacturing Systems Engineering.;
  • 学科 Engineering Industrial.
  • 学位 Ph.D.
  • 年度 2008
  • 页码 186 p.
  • 总页数 186
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 一般工业技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号