...
首页> 外文期刊>Data & Knowledge Engineering >Algorithms for processing K-closest-pair queries in spatial databases
【24h】

Algorithms for processing K-closest-pair queries in spatial databases

机译:在空间数据库中处理K最接近对查询的算法

获取原文
获取原文并翻译 | 示例

摘要

This paper addresses the problem of finding the K closest pairs between two spatial datasets (the so-called, K closest pairs query, K-CPQ), where each dataset is stored in an R-tree. There are two different techniques for solving this kind of distance-based query. The first technique is the incremental approach, which returns the output elements one-by-one in ascending order of distance. The second one is the non-incremental alternative, which returns the K elements of the result all together at the end of the algorithm. In this paper, based on distance functions between two MBRs in the multidimensional Euclidean space, we propose a pruning heuristic and two updating strategies for minimizing the pruning distance, and use them in the design of three non-incremental branch-and-bound algorithms for K-CPQ between spatial objects stored in two R-trees. Two of those approaches are recursive following a Depth-First searching strategy and one is iterative obeying a Best-First traversal policy. The plane-sweep method and the search ordering are used as optimization techniques for improving the naive approaches. Besides, a number of interesting extensions of the K-CPQ (K-Self-CPQ, Semi-CPQ, K-FPQ (the K-farthest pairs query), etc.) are discussed. An extensive performance study is also presented. This study is based on experiments performed with real datasets. A wide range of values for the basic parameters affecting the performance of the algorithms is examined in order to designate the most efficient algorithm for each setting of parameter values. Finally, an experimental study of the behavior of the proposed K-CPQ branch-and-bound algorithms in terms of scalability of the dataset size and the K value is also included.
机译:本文解决了在两个空间数据集之间找到K个最接近的对的问题(所谓的K最接近的对查询,K-CPQ),其中每个数据集都存储在R树中。解决这种基于距离的查询有两种不同的技术。第一种技术是增量方法,它以距离的升序逐一返回输出元素。第二个是非增量选择,它在算法结束时一起返回结果的K个元素。本文基于多维欧几里得空间中两个MBR之间的距离函数,提出了一种修剪启发式方法和两种更新策略以最小化修剪距离,并将其用于三种非增量分支定界算法中。存储在两个R树中的空间对象之间的K-CPQ。这些方法中的两种是遵循深度优先搜索策略的递归方法,一种是遵循最佳优先遍历策略的迭代方法。平面扫描方法和搜索顺序被用作优化技术,以改进朴素的方法。此外,还讨论了K-CPQ的许多有趣扩展(K-Self-CPQ,Semi-CPQ,K-FPQ(K-最远对查询)等)。还介绍了广泛的性能研究。这项研究基于对真实数据集进行的实验。检查了影响算法性能的基本参数的各种值,以便为每种参数值设置指定最有效的算法。最后,还对所提出的K-CPQ分支定界算法在数据集大小和K值的可伸缩性方面的行为进行了实验研究。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号