首页> 外文期刊>Information Sciences: An International Journal >A hybrid decision tree/genetic algorithm method for data mining
【24h】

A hybrid decision tree/genetic algorithm method for data mining

机译:数据挖掘的混合决策树/遗传算法方法

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses the well-known classification task of data mining, where the objective is to predict the class which an example belongs to. Discovered knowledge is expressed in the form of high-level, easy-to-interpret classification rules. In order to discover classification rules, we propose a hybrid decision tree/genetic algorithm method. The central idea of this hybrid method involves the concept of small disjuncts in data mining, as follows. In essence, a set of classification rules can be regarded as a logical disjunction of rules, so that each rule can be regarded as a disjunct. A small disjunct is a rule covering a small number of examples. Due to their nature, small disjuncts are error prone. However, although each small disjunct covers just a few examples, the set of all small disjuncts can cover a large number of examples, so that it is important to develop new approaches to cope with the problem of small disjuncts. In our hybrid approach, we have developed two genetic algorithms (GA) specifically designed for discovering rules covering examples belonging to small disjuncts, whereas a conventional decision tree algorithm is used to produce rules covering examples belonging to large disjuncts. We present results evaluating the performance of the hybrid method in 22 real-world data sets. (C) 2003 Elsevier Inc. All rights reserved.
机译:本文解决了众所周知的数据挖掘分类任务,其目的是预测示例所属的类。发现的知识以高级,易于理解的分类规则的形式表示。为了发现分类规则,我们提出了一种混合决策树/遗传算法方法。这种混合方法的中心思想涉及数据挖掘中的小析取的概念,如下所示。本质上,可以将一组分类规则视为规则的逻辑分离,因此可以将每个规则视为分离。少量的杂项是覆盖少量示例的规则。由于其性质,小的析取语容易出错。但是,尽管每个小析取词仅涵盖几个示例,但所有小析取词的集合都可以涵盖大量示例,因此,开发新方法来解决小析取词的问题非常重要。在我们的混合方法中,我们开发了两种遗传算法(GA),这些遗传算法专门用于发现覆盖属于小分离对象的规则的规则,而常规的决策树算法用于生成覆盖属于大分离对象的规则的规则。我们提出了评估混合方法在22个实际数据集中的性能的结果。 (C)2003 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号