首页> 外文期刊>Computers, Materials & Continua >An Improved Algorithm for Mining Correlation Item Pairs
【24h】

An Improved Algorithm for Mining Correlation Item Pairs

机译:一种改进的挖掘相关项对算法

获取原文
获取原文并翻译 | 示例
           

摘要

Apriori algorithm is often used in traditional association rules mining, searching for the mode of higher frequency. Then the correlation rules are obtained by detected the correlation of the item sets, but this tends to ignore low-support high-correlation of association rules. In view of the above problems, some scholars put forward the positive correlation coefficient based on Phi correlation to avoid the embarrassment caused by Apriori algorithm. It can dig item sets with low-support but high-correlation. Although the algorithm has pruned the search space, it is not obvious that the performance of the running time based on the big data set is reduced, and the correlation pairs can be meaningless. This paper presents an improved mining algorithm with new association rules based on interestingness for correlation pairs, using an upper bound on interestingness of the supersets to prune the search space. It greatly reduces the running time, and filters the meaningless correlation pairs according to the constraints of the redundancy. Compared with the algorithm based on the Phi correlation coefficient, the new algorithm has been significantly improved in reducing the running time, the result has pruned the redundant correlation pairs. So it improves the mining efficiency and accuracy.
机译:APRIORI算法通常用于传统关联规则挖掘,搜索更高频率的模式。然后通过检测项目集的相关性获得相关规则,但这倾向于忽略关联规则的低支持高相关。鉴于上述问题,一些学者基于PHI相关提出了正相关系数,以避免由APRiori算法引起的尴尬。它可以用低支持但高相关挖掘项目集。虽然该算法修剪了搜索空间,但不明显地减少了基于大数据集的运行时间的性能,并且相关对可能毫无意义。本文提出了一种改进的挖掘算法,基于相关对的有趣性的新关联规则,使用超限对搜索空间进行修剪的上限。它大大减少了运行时间,并根据冗余的约束过滤无意义的相关对。与基于PHI相关系数的算法相比,在减少运行时,新算法显着提高,结果已修剪冗余相关对。因此它提高了采矿效率和准确性。

著录项

  • 来源
    《Computers, Materials & Continua》 |2020年第1期|337-354|共18页
  • 作者单位

    College of Electronic and Information Engineering Nanjing University of Information Science and Technology Nanjing 210044 China;

    College of Electronic and Information Engineering Nanjing University of Information Science and Technology Nanjing 210044 China;

    College of Computer and Software Nanjing University of Information Science and Technology Nanjing 210044 China;

    International Business Machines Corporation (IBM) New York USA;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Interestingness; item pairs; positive correlation; association rules; redundancy;

    机译:有趣;物品对;正相关;协会规则;冗余;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号