首页> 外文会议>International Conference on Advanced Computational Intelligence >MapReduce-based efficient algorithm for finding large patterns
【24h】

MapReduce-based efficient algorithm for finding large patterns

机译:基于MapReduce的高​​效算法,用于查找大型模式

获取原文

摘要

Finding large patterns is an objective of computational intelligence and a key step in many data mining applications, in particular in big data applications, where the scalability of mining algorithms is a great issue. This paper proposes an efficient algorithm Pampas that takes full advantage of the MapReduce framework in addressing the scalability issue. The novelty lies in two aspects: Pampas is the first parallel algorithm that integrates a breadth-first search strategy with a vertical mining approach, and Pampas proposes to employ different vertical formats in combination to represent the data, which improves not only scalability but also efficiency. Extensive experimental results demonstrate that the proposed algorithm outperforms the existing algorithms and scales out well with respect to database size and cluster size.
机译:查找大模式是计算智能的目标,并且是许多数据挖掘应用程序中的关键步骤,尤其是在大数据应用程序中,在大数据应用程序中,挖掘算法的可伸缩性是一个大问题。本文提出了一种有效的算法Pampas,该算法充分利用了MapReduce框架来解决可伸缩性问题。新颖之处在于两个方面:Pampas是第一个将广度优先搜索策略与垂直挖掘方法相集成的并行算法,Pampas提出结合使用不同的垂直格式来表示数据,这不仅提高了可伸缩性,而且提高了效率。大量的实验结果表明,该算法优于现有算法,并且在数据库大小和集群大小方面都可以很好地扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号