【24h】

Enhancing SpatialHadoop with Closest Pair Queries

机译:通过最近对查询增强SpatialHadoop

获取原文

摘要

Given two datasets P and Q, the K Closest Pair Query (KCPQ) finds the K closest pairs of objects from P×Q. It is an operation widely adopted by many spatial and GIS applications. As a combination of the K Nearest Neighbor (KNN) and the spatial join queries, KCPQ is an expensive operation. Given the increasing volume of spatial data, it is difficult to perform a KCPQ on a centralized machine efficiently. For this reason, this paper addresses the problem of computing the KCPQ on big spatial datasets in SpatialHadoop, an extension of Hadoop that supports spatial operations efficiently, and proposes a novel algorithm in SpatialHadoop to perform efficient parallel KCPQ on large-scale spatial datasets. We have evaluated the performance of the algorithm in several situations with big synthetic and real-world datasets. The experiments have demonstrated the efficiency and scalability of our proposal.
机译:给定两个数据集P和Q,K最接近对查询(KCPQ)从P×Q中找到K最接近的对象对。它是许多空间和GIS应用程序广泛采用的一种操作。作为K最近邻(KNN)和空间联接查询的组合,KCPQ是一项昂贵的操作。鉴于空间数据量的不断增长,很难在​​集中式计算机上高效地执行KCPQ。因此,本文解决了在SpatialHadoop中对大型空间数据集进行KCPQ计算的问题,SpatialHadoop是有效支持空间操作的Hadoop扩展,并提出了一种在SpatialHadoop中对大型空间数据集执行高效并行KCPQ的新算法。我们已经在大型合成和真实数据集的几种情况下评估了该算法的性能。实验证明了我们建议的效率和可扩展性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号