首页> 外文期刊>Parallel Computing >Novel parallel method for association rule mining on multi-core shared memory systems
【24h】

Novel parallel method for association rule mining on multi-core shared memory systems

机译:多核共享内存系统上关联规则挖掘的新并行方法

获取原文
获取原文并翻译 | 示例

摘要

Association rule mining (ARM) is an important task in data mining with many practical applications. Current methods for association rule mining have shown unstable performance for different database types and under-utilize the benefits of multi-core shared memory machines. In this paper, we address these issues by presenting a novel parallel method for finding frequent patterns, the most computational intensive phase of ARM. Our proposed method, named ShaFEM, combines two mining strategies and applies the most appropriate one to each data subset of the database to efficiently adapt to the data characteristics and run fast on both sparse and dense databases. In addition, our new-lock-free design minimizes the synchronization needs and maximizes the data independence to enhance the scalability. The new structure lends itself well to dynamic job scheduling resulting in a well-balanced load on the new multi-core shared memory architectures. We have evaluated ShaFEM on 12-core multi-socket servers and found that our method run up to 5.8 times faster and consumes memory up to 7.1 times less than the state-of-the-art parallel method. For some test cases, ShaFEM can save up to 4.9 days of execution time over the compared method.
机译:在许多实际应用中,关联规则挖掘(ARM)是数据挖掘中的重要任务。当前用于关联规则挖掘的方法对于不同的数据库类型显示出不稳定的性能,并且未充分利用多核共享内存计算机的优势。在本文中,我们通过提出一种新颖的并行方法来查找频繁模式(ARM的计算量最大的阶段)来解决这些问题。我们提出的名为ShaFEM的方法结合了两种挖掘策略,并将最合适的一种应用于数据库的每个数据子集,以有效适应数据特征并在稀疏和密集数据库上快速运行。此外,我们的新无锁设计最大程度地减少了同步需求,并最大化了数据独立性,从而增强了可扩展性。新的结构非常适合动态作业调度,从而在新的多核共享内存体系结构上实现了均衡的负载。我们在12核多路服务器上评估了ShaFEM,发现我们的方法运行速度比最新的并行方法快5.8倍,消耗的内存少7.1倍。对于某些测试用例,与比较方法相比,ShaFEM最多可以节省4.9天的执行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号