【24h】

A Parallel Joinless Algorithm for Co-location Pattern Mining Based on Group-Dependent Shard

机译:基于群相关分片的并行无连接共处模式挖掘算法

获取原文

摘要

Spatial co-location patterns, whose instances are frequently located together in geography, are particularly valuable for discovering spatial dependencies. Since its inception, lots of co-location pattern mining algorithms have been developed, but the computational cost remains prohibitively expensive with large data size. In this work, we propose to parallelize joinless algorithm on MapReduce framework. Our approach partitions computation in such a way that each machine independently executes joinless algorithm to finish a group of mining tasks. Such partitioning eliminates computational dependencies and reduces communication cost between machines. Moreover, a novel pruning technique is suggested to improve mining performance. The experimental results on synthetic and real-world data sets show that the parallel joinless algorithm is efficient and scalable.
机译:空间共同定位模式(其实例在地理区域中经常位于一起)对于发现空间依存关系特别有价值。自从它诞生以来,已经开发了许多共置模式挖掘算法,但是在大数据量的情况下,计算成本仍然高得令人望而却步。在这项工作中,我们建议在MapReduce框架上并行化无连接算法。我们的方法以一种方式对计算进行分区,以使每台机器独立执行无连接算法来完成一组挖掘任务。这种划分消除了计算依赖性,并减少了机器之间的通信成本。此外,提出了一种新颖的修剪技术以改善采矿性能。在综合和真实数据集上的实验结果表明,并行无连接算法是有效且可扩展的。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号