Performance Evaluation of Apriori Algorithm on a Hadoop Cluster

机译：Hadoop集群对APRIORI算法的性能评估

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Frequent Itemset Mining is a well-known concept in data sciences. If we feed frequent itemset miner algorithms with large datasets they become resource hungry fast as their search space explodes. This problem is even more apparent when we try to use them on Big Data. Recent advances in parallel programming provides good solutions to deal with large datasets but they present their own problems when we try to modify existing data mining algorithms for the new paradigms. The Apriori-algorithm is a classic solution for mining frequent item-sets. In this paper, we provide a parallel implementation of the Apriori algorithm for the Hadoop platform. We introduce a method to measure the performance of the distributed algorithm. In our experimental results we find choke points in the algorithm and provide resolutions.

机译：频繁的项目集挖掘是数据科学中的众所周知的概念。如果我们使用大型数据集馈送频繁的项目集矿物算法，它们将在其搜索空间爆炸时快速成为饥饿的资源。当我们尝试在大数据上使用它们时，此问题更加明显。并行编程的最新进展提供了处理大型数据集的良好解决方案，但是当我们尝试修改新的范例的现有数据挖掘算法时，他们呈现自己的问题。 APRiori-Algorithm是挖掘频繁项目集的经典解决方案。在本文中，我们提供了Hadoop平台的APRiori算法的并行实现。我们介绍一种测量分布式算法性能的方法。在我们的实验结果中，我们在算法中发现了Choke点并提供了分辨率。

著录项

来源
《International Conference on Circuits, Systems, Control, Signals;International Conference on Acoustics, Speech and Audio Processing》|2014年||共8页
会议地点
作者
JANOS ILLES; ISTVAN VAJK;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP2-53;
关键词
Hadoop; MapReduce; Apriori-algorithm; Frequent itemset mining; Cloud computing;

机译：Hadoop;mapreduce;apriori算法;频繁的项目集挖掘;云计算;

相似文献

外文文献
中文文献
专利

1. Performance optimization of MapReduce-based Apriori algorithm on Hadoop cluster [J] . Singh Sudhakar, Garg Rakhi, Mishra P. K. Computers and Electrical Engineering . 2018,第期

机译：基于MapReduce的Apriori算法在Hadoop集群中的性能优化
2. PARALLEL IMPLEMENTATION OF APRIORI ALGORITHMS ON THE HADOOP-MAPREDUCE PLATFORM- AN EVALUATION OF LITERATURE [J] . A.L.SAYETH SAABITH, ELANKOVAN SUNDARARAJAN, AZURALIZA ABU BAKAR Journal of Theoretical and Applied Information Technology . 2016,第3期

机译：Hadoop-Mapreduce平台上先验算法的并行实现-文学评估
3. Improved FTWeightedHashT Apriori Algorithm for Big Data using Hadoop-MapReduce Model [J] . Sarem M. Ammar, Fadl M. Ba-Alwi British Journal of Mathematics & Computer Science . 2018,第1期

机译：使用Hadoop-MapReduce模型改进的大数据FTWeightedHashT先验算法
4. Performance Evaluation of Apriori Algorithm on a Hadoop Cluster [C] . JANOS ILLES, ISTVAN VAJK International Conference on Circuits, Systems, Control, Signals . 2014

机译：Hadoop集群对APRIORI算法的性能评估
5. High-performance cluster computing, algorithms, implementations and performance evaluation for computation-intensive applications to promote complex scientific research on turbulent flows. [D] . Wang, Hao. 2001

机译：面向计算密集型应用程序的高性能群集计算，算法，实现和性能评估，以促进对湍流的复杂科学研究。
6. The water supply association analysis method in Shenzhen based on kmeans clustering discretization and apriori algorithm [O] . Xin Liu, Xuefeng Sang, Jiaxuan Chang, 2021

机译：深圳供水协会分析方法基于Kmeans聚类离散化和APRiori算法
7. Performance Analysis of Apriori Algorithm with Different Data Structures on Hadoop Cluster [O] . Singh, Sudhakar, Garg, Rakhi, Mishra, P. K. 2015

机译：不同数据结构的apriori算法性能分析在Hadoop集群上

Performance Evaluation of Apriori Algorithm on a Hadoop Cluster

摘要

著录项

相似文献

相关主题

期刊订阅