MapReduce-based efficient algorithm for finding large patterns

机译：基于MapReduce的高效算法，用于查找大型模式

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Finding large patterns is an objective of computational intelligence and a key step in many data mining applications, in particular in big data applications, where the scalability of mining algorithms is a great issue. This paper proposes an efficient algorithm Pampas that takes full advantage of the MapReduce framework in addressing the scalability issue. The novelty lies in two aspects: Pampas is the first parallel algorithm that integrates a breadth-first search strategy with a vertical mining approach, and Pampas proposes to employ different vertical formats in combination to represent the data, which improves not only scalability but also efficiency. Extensive experimental results demonstrate that the proposed algorithm outperforms the existing algorithms and scales out well with respect to database size and cluster size.

机译：查找大模式是计算智能的目标，并且是许多数据挖掘应用程序中的关键步骤，尤其是在大数据应用程序中，在大数据应用程序中，挖掘算法的可伸缩性是一个大问题。本文提出了一种有效的算法Pampas，该算法充分利用了MapReduce框架来解决可伸缩性问题。新颖之处在于两个方面：Pampas是第一个将广度优先搜索策略与垂直挖掘方法相集成的并行算法，Pampas提出结合使用不同的垂直格式来表示数据，这不仅提高了可伸缩性，而且提高了效率。大量的实验结果表明，该算法优于现有算法，并且在数据库大小和集群大小方面都可以很好地扩展。

著录项

来源
《International Conference on Advanced Computational Intelligence》|2015年|164-169|共6页
会议地点
作者
Liu Junqiang; Wu Yongsheng; Xu Shijian; Zhou Qingfeng; Xu Mengtao;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Efficient Algorithms for Finding All Length-Maximal Flock Patterns from a Set of Trajectories [J] . Hiroki Arimura, Xiaoliang Geng, Takeaki Uno 電子情報通信学会技術研究報告. コンピュテ-ション. Theoretical Foundations of Computing . 2013,第50期

机译：从一组轨迹中查找所有长度最大的羊群图案的高效算法
2. An efficient algorithm for finding frequent and diverse subgraph patterns in PPI networks [J] . Ma Wubin, Liu Mingxing, Huang Hongbin, International Journal of Sensor Networks . 2014,第4期

机译：在PPI网络中查找频繁且多样的子图模式的有效算法
3. An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division [J] . DawenXia, BinfengWang, YantaoLi, Discrete dynamics in nature and society . 2015,第2期

机译：基于MapReduce的高效分布式交通分区划分并行聚类算法
4. MapReduce-based efficient algorithm for finding large patterns [C] . Liu Junqiang, Wu Yongsheng, Xu Shijian, International Conference on Advanced Computational Intelligence . 2015

机译：基于Mapreduce的高效算法查找大型图案
5. Finding patterns in music data files with data mining algorithms [D] . Akhtar, Amin 2014

机译：使用数据挖掘算法在音乐数据文件中查找模式
6. Teaching Ordinal Patterns to a Computer: Efficient Encoding Algorithms Based on the Lehmer Code [O] . Sebastian Berger, Andrii Kravtsiv, Gerhard Schneider, 2019

机译：教导序数模式到计算机：基于Lehmer代码的高效编码算法
7. An Efficient MapReduce-Based Parallel Clustering Algorithm for Distributed Traffic Subarea Division [O] . Dawen Xia, Binfeng Wang, Yantao Li, 2015

机译：基于MapReduce的分布式流量子区域划分的并行聚类算法

MapReduce-based efficient algorithm for finding large patterns

摘要

著录项

相似文献

相关主题

期刊订阅