首页> 外文学位 >A data mining framework for efficient discovery of classification rules.
【24h】

A data mining framework for efficient discovery of classification rules.

机译:一种有效发现分类规则的数据挖掘框架。

获取原文
获取原文并翻译 | 示例

摘要

Associative classification is an important research topic in data mining (DM). The thesis proposes a framework to derive accurate and interesting classification rules using the association rule mining (ARM) technique. To effectively address the rule discovery task, in the framework, two fundamental problems in the pre-processing and the post-processing components of the DM process are identified. In the pre-processing component, it is identified that the choice of the training set is an important factor in deriving good classification rules. The thesis proposes a novel technique using a genetic algorithm (GA) to find an appropriate split of a dataset into training and test sets. Using the obtained training set as the input to the ARM technique generates high accuracy classification rules. It is also identified that an algorithm (or heuristic) is required to find the best set of interesting and accurate rules from the discovered ones. In the post-processing component, the thesis proposes a pruning strategy using a GA to find the accurate interesting rules.
机译:关联分类是数据挖掘(DM)中的重要研究主题。本文提出了一种使用关联规则挖掘(ARM)技术导出准确有趣的分类规则的框架。为了有效地解决规则发现任务,在该框架中,确定了DM过程的预处理和后处理组件中的两个基本问题。在预处理组件中,可以确定训练集的选择是得出良好分类规则的重要因素。本文提出了一种使用遗传算法(GA)的新技术,该技术可以将数据集适当地分割为训练集和测试集。使用获得的训练集作为ARM技术的输入,可以生成高精度分类规则。还确定了需要一种算法(或启发式)从发现的规则中找到最佳的一组有趣且准确的规则。在后处理组件中,论文提出了一种利用遗传算法找到准确有趣规则的修剪策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号