首页> 外文期刊>International Journal for Computers and Their Applications >Improvement of Apriori Algorithm for Missing Itemset Identification and Faster Execution
【24h】

Improvement of Apriori Algorithm for Missing Itemset Identification and Faster Execution

机译:缺少项目集识别和更快的执行的APRiori算法的改进

获取原文
获取原文并翻译 | 示例
           

摘要

Association rule mining (ARM) is an important data mining strategy to analyze the relationship among the items. Apriori algorithm is the most used approach to implement association rule mining. We identified two major issues of Apriori. Apriori follows an iterative approach consisting of multiple database scans for searching frequent itemsets that satisfy certain threshold criteria. The same predefined threshold value is maintained throughout the repetitive stages of the Apriori method and hence there is a huge possibility of discarding higher-order itemsets, though all of its sub-itemsets are frequent. Some of these ignored itemsets if used intelligently have a huge potential for business value addition. Furthermore, in the Apriori procedure, an exponential number of computations is required to check whether an item is important or not and that makes the entire pattern mining system costly. In this study first, we identify the hidden business-critical item sets that are otherwise ignored in the traditional Apriori process. Furthermore, a novel approach is proposed here to evaluate whether an item is interesting or not at a considerably reduced computational time.
机译:关联规则挖掘(ARM)是分析物品之间关系的重要数据挖掘策略。 APRIORI算法是实现关联规则挖掘最常用的方法。我们确定了两个Apriori的主要问题。 Apriori遵循一个迭代方法,包括多个数据库扫描,用于搜索满足某些阈值标准的频繁项目集。在APRIORI方法的重复阶段中保持相同的预定义阈值,因此丢弃了丢弃高阶项集的巨大可能性,尽管其所有子项集频繁。其中一些被忽略的项目集,如果使用智能地具有巨大的业务价值潜力。此外,在APRIORI过程中,需要指数的计算数来检查项目是否重要,并且使整个模式挖掘系统成本高昂。在这项研究中,我们确定了传统的Apriori过程中忽略的隐藏业务关键项集。此外,这里提出了一种新的方法来评估项目是否有趣,或者在显着降低的计算时间内。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号