首页> 外文期刊>Knowledge and Data Engineering, IEEE Transactions on >Scalable Distributed Processing of K Nearest Neighbor Queries over Moving Objects
【24h】

Scalable Distributed Processing of K Nearest Neighbor Queries over Moving Objects

机译:运动对象上K个最近邻查询的可扩展分布式处理

获取原文
获取原文并翻译 | 示例

摘要

Central to many applications involving moving objects is the task of processing -nearest neighbor (-NN) queries. Most of the existing approaches to this problem are designed for the centralized setting where query processing takes place on a single server; it is difficult, if not impossible, for them to scale to a distributed setting to handle the vast volume of data and concurrent queries that are increasingly common in those applications. To address this problem, we propose a suite of solutions that can support scalable distributed processing of -NN queries. We first present a new index structure called Dynamic Strip Index (DSI), which can better adapt to different data distributions than exiting grid indexes. Moreover, it can be naturally distributed across the cluster, therefore lending itself well to distributed processing. We further propose a distributed -NN search (DKNN) algorithm based on DSI. DKNN avoids having an uncertain number of potentially expensive iterations, and is thus more efficient and more predictable than existing approaches. DSI and DKNN are implemented on Apache S4, an open-source platform for distributed stream processing. We perform extensive experiments to study the characteristics of DSI and DKNN, and compare them with three baseline methods. Experime- tal results show that our proposal scales well and significantly outperforms the alternative methods.
机译:处理涉及移动对象的许多应用程序的中心是处理-最近邻居(-NN)查询的任务。解决该问题的大多数现有方法都是针对集中式设置设计的,其中查询处理在单个服务器上进行。即使不是不可能,它们也很难扩展到分布式设置,以处理在这些应用程序中越来越普遍的大量数据和并发查询。为了解决这个问题,我们提出了一套解决方案,可以支持-NN查询的可扩展分布式处理。我们首先提出一种称为动态条带索引(DSI)的新索引结构,该结构比现有的网格索引可以更好地适应不同的数据分布。而且,它可以自然地分布在整个集群中,因此很适合进行分布式处理。我们还提出了一种基于DSI的分布式-NN搜索(DKNN)算法。 DKNN避免了不确定数量的潜在昂贵迭代,因此比现有方法更有效,更可预测。 DSI和DKNN在Apache S4上实现,Apache S4是用于分布式流处理的开源平台。我们进行了广泛的实验来研究DSI和DKNN的特性,并将它们与三种基线方法进行比较。实验结果表明,我们的建议可以很好地扩展,并且明显优于其他方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号