An efficient FP-Growth based association rule mining algorithm using Hadoop MapReduce

A Senthilkumar; D Hari Prasad

首页> 外文期刊>Indian Journal of Science and Technology >An efficient FP-Growth based association rule mining algorithm using Hadoop MapReduce

【24h】

An efficient FP-Growth based association rule mining algorithm using Hadoop MapReduce

机译：一种高效的基于FP-生长的关联规则挖掘算法使用Hadoop MakReduce

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Objectives: To achieve improved performance of FP-Growth based Association Rule Mining algorithm for massive data by effective utilization of storage,execution capability and improved partition technique within the Hadoop MapReduce framework. Methodology: The proposed methodology has four main phases: In the first phase, the item sets for finding the frequent pattern are encoded and thus minimizes the expensive operation for large data set. In the second phase, improved hash partitioning reduces the network overhead and improves the communication speed within the MapReduce phase for each item set. The effective usage of network bandwidth and storage is obtained by the impact of compression in the third phase. The use of combiner in final phase for frequent item set mining minimizes the overhead of reduce phase by finding the pattern in each partition and minimizes the overall execution time of the FP-Growth algorithm. Findings: FP-Growth based association rule mining algorithm is designed for parallel execution on distributed cluster of servers. Changes to the MapReduce implementation of FP-Growth with the impact of encoding. Improved hash partitioning, compression and configuration results in a significant performance gain with better improvement in execution time.Novelty/Improvements: According to the experimental results, the changes in storage and processing level within the MapReduce framework improves the overall performance of the parallel frequent item set mining in Hadoop cluster.

机译：目标：通过有效利用Hadoop MapReduce框架内的存储，执行能力和改进的分区技术来实现基于FP-Growce Culity挖掘算法的改进性能。方法论：所提出的方法有四个主要阶段：在第一阶段，用于查找频繁模式的项目集被编码，从而最大限度地减少了大数据集的昂贵操作。在第二阶段中，改进的散列分区减少了网络开销，并提高了每个项目集的Mapreduce阶段内的通信速度。通过压缩在第三阶段的影响获得了网络带宽和存储的有效使用。在频繁项目集挖掘的最终阶段中使用组合器通过在每个分区中查找模式来最小化降低阶段的开销，并最大限度地减少FP-Grows算法的总执行时间。调查结果：FP-Growce基础的关联规则挖掘算法旨在用于Servered Server的分布式群集上的并行执行。通过编码的影响，对MAPREDUCE实施FP-Grower的实施。改进的散列分区，压缩和配置导致显着的性能增益，执行时间更好地提高.Novelty /改进：根据实验结果，MapReduce框架内的存储和处理级别的变化提高了并行频繁项目的整体性能在Hadoop集群中设置挖掘。

著录项

来源
《Indian Journal of Science and Technology》 |2020年第34期|共11页
作者
A Senthilkumar; D Hari Prasad;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类连续性出版物;
关键词
Association rule miningHadoopMapReduceFP-Growth;

机译：协会规则MininghadoopMapreduceFP-Grower;
入库时间 2022-08-19 01:53:18

相似文献

外文文献
中文文献
专利

1. An improvement of FP-Growth association rule mining algorithm based on adjacency table [J] . Ming Yin, Wenjie Wang, Yang Liu, MATEC Web of Conferences . 2018,第3期

机译：基于邻接表的FP-Growth关联规则挖掘算法的改进
2. Map-optimize-reduce: CAN tree assisted FP-growth algorithm for clusters based FP mining on Hadoop [J] . Ragaventhiran J., Kavithadevi M. K. Future generation computer systems . 2020,第Feba期

机译：Map-optimize-reduce：CAN树辅助的FP增长算法，用于基于集群的Hadoop上的FP挖掘
3. Positive and negative association rule mining in Hadoop’s MapReduce environment [J] . Sikha Bagui, Probal Chandra Dhar Journal of Big Data . 2019,第1期

机译：Hadoop MapReduce环境中的正负关联规则挖掘
4. Failure Part Mining Using an Association Rules Mining by FP-Growth and Apriori Algorithms: Case of ATM Maintenance in Thailand [C] . Nachirat Rachburee, Jedsada Arunrerk, Wattana Punlumjeak International Conference on IT Convergence and Security . 2017

机译：使用关联规则通过FP-Growth和Apriori算法进行故障零件挖掘：泰国ATM维护案例
5. Efficient sequential and parallel algorithms for mining association rules in text databases [D] . Holt, John D. 2003

机译：用于挖掘文本数据库中关联规则的高效顺序和并行算法
6. TSARM-UDP: An Efficient Time Series Association Rules Mining Algorithm Based on Up-to-Date Patterns [O] . Qiang Zhao, Qing Li, Deshui Yu, 2021

机译：TSARM-UDP：基于最新模式的有效时间序列关联规则挖掘算法
7. An efficient FP-Growth based association rule mining algorithm using Hadoop MapReduce [O] . A Senthilkumar 2020

机译：一种高效的基于FP-生长的关联规则挖掘算法使用Hadoop MakReduce

An efficient FP-Growth based association rule mining algorithm using Hadoop MapReduce

摘要

著录项

相似文献

相关主题

期刊订阅