...
首页> 外文期刊>Knowledge and information systems >parallel data mining for association rules on shared-memory systems
【24h】

parallel data mining for association rules on shared-memory systems

机译:共享内存系统上关联规则的并行数据挖掘

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper we present a new parallel algorithm for data mining of association rules on shared-memory multiprocessors. We study the degree of parallelism, synchronization, and data locality issues, and present optimizations for fast frequency computation. Experiments show that a significant improvement of performance is achieved using our proposed optimizations. We also achieved good speed-up for the parallel algorithm. A lot of data-mining tasks (e.g. association rules, sequential patterns) use complex pointer-based data structures (e.g. hash trees) that typically suffer from suboptimal data locality. In the multiprocessor case shared access to these data structures may also result in false sharing. For these tasks it is commonly observed that the recursive data structure is built once and accessed multiple times during each iteration. Furthermore, the access patterns after the build phase are highly ordered. In such cases locality and false sharing sensitive memory placement of these structures can enhance performance significantly. We evaluate a set of placement policies for parallel association discovery, and show that simple placement schemes can improve execution time by more than a factor of two. More complex schemes yield additional gains.
机译:在本文中,我们提出了一种新的并行算法,用于在共享内存多处理器上进行关联规则的数据挖掘。我们研究了并行度,同步度和数据局部性问题,并提出了用于快速频率计算的优化方法。实验表明,使用我们提出的优化技术可以显着提高性能。我们还为并行算法实现了良好的加速。许多数据挖掘任务(例如关联规则,顺序模式)使用通常基于次优数据局部性的基于指针的复杂数据结构(例如哈希树)。在多处理器情况下,对这些数据结构的共享访问也可能导致错误共享。对于这些任务,通常会观察到,递归数据结构仅构建一次,并且在每次迭代过程中都会多次访问。此外,构建阶段之后的访问模式是高度有序的。在这种情况下,这些结构的局部性和错误共享敏感内存的放置可以显着提高性能。我们评估了一组用于并行关联发现的放置策略,并表明简单的放置方案可以将执行时间缩短两倍以上。更复杂的方案会带来更多收益。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号