首页> 外文期刊>Information Systems >Fast similarity join for multi-dimensional data
【24h】

Fast similarity join for multi-dimensional data

机译:快速相似联接,用于多维数据

获取原文
获取原文并翻译 | 示例
       

摘要

The efficient processing of multidimensional similarity joins is important for a large class of applications. The dimensionality of the data for these applications ranges from low to high. Most existing methods have focused on the execution of high-dimensional joins over large amounts of disk-based data. The increasing sizes of main memory available on current computers, and the need for efficient processing of spatial joins suggest that spatial joins for a large class of problems can be processed in main memory. In this paper, we develop two new in-memory spatial join algorithms, the Grid-join and EGO~*-join, and study their performance. Through evaluation, we explore the domain of applicability of each approach and provide recommendations for the choice of a join algorithm depending upon the dimensionality of the data as well as the expected selectivity of the join. We show that the two new proposed join techniques substantially outperform the state-of-the-art join algorithm, the EGO-join.
机译:多维相似联接的有效处理对于一大类应用程序很重要。这些应用程序的数据维数范围从低到高。大多数现有方法都专注于对大量基于磁盘的数据执行高维联接。当前计算机上可用的主存储器大小不断增加,以及对空间联接的有效处理的需求表明,可以在主存储器中处理大量问题的空间联接。在本文中,我们开发了两种新的内存空间连接算法,即Grid-join和EGO〜* -join,并研究了它们的性能。通过评估,我们探索了每种方法的适用范围,并根据数据的维数以及连接的预期选择性为连接算法的选择提供了建议。我们表明,这两种新提出的连接技术大大优于最新的连接算法EGO-join。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号