一种新的基于FP_Growth的频繁项目集并行挖掘算法

孙鸿艳; 吉根林

首页> 中文期刊>南京师大学报（自然科学版） >一种新的基于FP_Growth的频繁项目集并行挖掘算法

一种新的基于FP_Growth的频繁项目集并行挖掘算法

开具论文收录证明 >>

期刊封面封底目录下载 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

频繁项目集挖掘用于发现项目之间的关联规则.为了高效求解面向大数据的频繁项目集,本文提出一种新的基于HP_Growth的频繁项目集并行挖掘算法NPFP_Growth(New Parallel algorithm based on FP_Growth),该算法对频繁模式树的存储结构进行改进,基于Map/Reduce并行计算模型,利用HDFS实现数据存储,在各自计算节点上构造局部频繁模式树,求解该局部频繁模式树中每个分支的最长全局频繁项目集;对于全局非频繁项目集,计算其支持数,发送至相应计算节点进行支持度统计,从而以较为简单的算法实现频繁项目集并行挖掘.实验表明,NPFP_Growth算法具有较高的计算效率和良好的可伸缩性.%Mining of frequent item sets is used to find the association rules between items.In order to get frequent item sets of big data efficiently,this paper proposes a new parallel algorithm for mining frequent item sets based on FP_ Growth,named NPFP_Growth(New Parallel algorithm based on FP_Growth).The storage structure of local frequent pat tern tree is improved and created in each node based on parallel computing model Map/Reduce and distributed storage system HDFS,and then longest global frequent item sets are mined in each branch of the tree.Finally,Support for item sets which does not meet global minimum support is computed and then sent to corresponding computing node to count.Parallel mining algorithm NPFP_Growth is implemented.The experimental results show that the algorithm have high computing efficiency and good scalability.

著录项

来源
《南京师大学报（自然科学版）》|2016年第4期|19-24|共6页
作者
孙鸿艳; 吉根林;
展开▼
作者单位

南京师范大学计算机科学与技术学院,江苏南京210023;

南京师范大学计算机科学与技术学院,江苏南京210023;

展开▼
原文格式 PDF
正文语种 chi
中图分类 TP311.11;
关键词
频繁项目集; 关联规则; FP_Growth; Hadoop; Map/Reduce;

相似文献

中文文献
外文文献
专利

1. 一种基于FP-Growth的频繁项目集并行挖掘算法 [J] . 章志刚 ,吉根林 . 计算机工程与应用 . 2014,第002期
2. 一种新的最大频繁项目集挖掘算法 [J] . 马丽生 ,邓辉文 ,齐逸 . 计算机应用 . 2006,第011期
3. 一种基于频繁模式树的最大频繁项目集挖掘算法 [J] . 任永功 ,张亮 ,付玉 . 小型微型计算机系统 . 2010,第002期
4. 一种基于FP-tree的最大频繁项目集挖掘算法 [J] . 梅俊 ,郑刚 . 现代计算机（专业版） . 2009,第009期
5. 一种基于集合的频繁项目集挖掘算法 [J] . 苏海英 . 广东石油化工学院学报 . 2008,第004期
6. 一种改进的基于关联图的频繁项目集挖掘算法 [C] . 曹馨宇 ,李玉忱 ,任磊 . 2005中国计算机大会 . 2005
7. 基于Spark的并行FP_Growth算法的优化方法研究 [A] . 张欣 . 2018

一种新的基于FP_Growth的频繁项目集并行挖掘算法

摘要

著录项

相似文献

相关主题

期刊订阅