首页> 外文OA文献 >PaMPa-HD: A Parallel MapReduce-Based Frequent Pattern Miner for High-Dimensional Data
【2h】

PaMPa-HD: A Parallel MapReduce-Based Frequent Pattern Miner for High-Dimensional Data

机译:PaMPa-​​HD:基于并行MapReduce的高​​维数据频繁模式挖掘器

摘要

Frequent closed itemset mining is among the most complex exploratory techniques in data mining, and provides the ability to discover hidden correlations in transactional datasets. The explosion of Big Data is leading to new parallel and distributed approaches. Unfortunately, most of them are designed to cope with low-dimensional datasets, whereas no distributed high-dimensional frequent closed itemset mining algorithms exists. This work introduces PaMPa-HD, a parallel MapReduce-based frequent closed itemset mining algorithm for high-dimensional datasets, based on Carpenter. The experimental results, performed on both real and synthetic datasets, show the efficiency and scalability of PaMPa-HD
机译:频繁的封闭项集挖掘是数据挖掘中最复杂的探索技术之一,并且提供了发现事务数据集中隐藏的关联的能力。大数据的爆炸导致了新的并行和分布式方法。不幸的是,它们中的大多数被设计来处理低维数据集,而没有分布式高维频繁封闭项集挖掘算法存在。这项工作介绍了PaMPa-​​HD,这是一种基于Carpenter的基于并行MapReduce的针对高维数据集的频繁封闭项目集挖掘算法。在真实和合成数据集上进行的实验结果表明,PaMPa-​​HD的效率和可扩展性

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号