首页> 外文会议>ICCAS-SICE International Joint Conference >Fuzzy Association Rule Mining and Classifier with Chi-squared Correlation Measure using Genetic Network Programming
【24h】

Fuzzy Association Rule Mining and Classifier with Chi-squared Correlation Measure using Genetic Network Programming

机译:模糊协会规则挖掘与遗传网络编程的Chi平方相关措施的分类

获取原文

摘要

One of the most important issues in any association rule mining is the interpretation and evaluation of discovered rules. Thus, most algorithms employ the support-confidence framework for evaluating association and classification rules. Unfortunately, recent studies show that the support and confidence measures are insufficient for filtering out uninteresting association rules, for instance, even strong association rules can be uninteresting and misleading. To deal with this limitation, the support-confidence framework can be suplemented with additional interestingness measures based on statistical significance and correlation analysis. In this paper, a novel fuzzy association rule-based classification approach is proposed, where χ~2 is applied as a correlation measure. The algorithm is based on Genetic Network Programming (GNP) and discover comprehensible fuzzy association rules potentially useful for classification. GNP is an evolutionary optimization algorithm that uses directed graph structures as genes instead of strings and trees of Genetic Algorithms (GA) and Genetic Programming (GP), respectively. This feature contributes to creating quite compact programs and implicitly memorizing past action sequences. The proposed model consists of two major phases: 1) generating fuzzy class association rules by using GNP, 2) building a classifier based on the extracted fuzzy rules. In the first phase, χ~2 is used for computing the correlation of the rules to be integrated into the classifier. In the second phase, the χ~2 value is used as a weight of the rule when calculating the matching degree of the rule with new data. The performance of the proposed algorithm has been compared with other relevant algorithms and the experimental results have shown the advantages and effectiveness of the proposed model.
机译:任何关联规则挖掘中最重要的问题之一是对被发现的规则的解释和评估。因此,大多数算法采用用于评估关联和分类规则的支持置信框架。遗憾的是,最近的研究表明,支持和置信度量不足以过滤不感兴趣的关联规则,例如,即使是强大的关联规则也可能是不对和误导性的。为了处理这一限制,基于统计显着性和相关性分析,可以使用额外的有趣措施来减排支持信心框架。本文提出了一种新型模糊关联规则的分类方法,其中χ〜2被应用为相关措施。该算法基于基于遗传网络编程(GNP),并发现可用于分类的可综合模糊关联规则。 GNP是一种进化优化算法,其使用指向图结构作为基因而不是遗传算法(GA)和遗传编程(GP)的串和树木。此功能有助于创建相当紧凑的程序,并隐含地记住过去的动作序列。所提出的模型由两个主要阶段组成:1)通过使用GNP,2)基于提取的模糊规则构建分类器来生成模糊类关联规则。在第一阶段,χ〜2用于计算要集成到分类器中的规则的相关性。在第二阶段中,在计算具有新数据的规则匹配程度时,χ〜2值用作规则的权重。将所提出的算法的性能与其他相关算法进行比较,实验结果表明了所提出的模型的优点和有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号