【24h】

Parallel Algorithms for High-dimensional Proximity Joins

机译:高维邻近联接的并行算法

获取原文
获取原文并翻译 | 示例

摘要

We consider the problem of parallelizing high-dimensional proximity joins. We present a parallel multidimensional join algorithm based on an the epsilon-kdB tree and compare it with the more common approach of space partitioning. An evaluation of the algorithms on an IBM SP2 shared-nothing multiprocessor is presented using both synthetic and real-life datasets. We also examine the effectiveness of the algorithms in the context of a specific data-mining problem, that of finding similar time-series. The empirical results show that our algorithm exhibits good performance and scalability, as well an ability to handle data-skew.
机译:我们考虑并行化高维接近连接的问题。我们提出了一种基于epsilon-kdB树的并行多维连接算法,并将其与更常见的空间分区方法进行了比较。同时使用综合和实际数据集对IBM SP2无共享多处理器上的算法进行了评估。我们还研究了在特定数据挖掘问题(查找相似时间序列)的情况下算法的有效性。实验结果表明,我们的算法表现出良好的性能和可伸缩性,以及处理数据偏斜的能力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号