首页> 外文会议>International Workshop on Similarity Research and Applications >Approximate Direct and Reverse Nearest Neighbor Queries, and the k-nearest Neighbor Graph
【24h】

Approximate Direct and Reverse Nearest Neighbor Queries, and the k-nearest Neighbor Graph

机译:近似直接和反向最近的邻查询,以及k最近邻图

获取原文

摘要

Retrieving the emph{k-nearest neighbors} of a query object is a basic primitive in similarity searching. A related, far less explored primitive is to obtain the dataset elements which would have the query object within their own emph{k}-nearest neighbors, known as the emph{reverse k-nearest neighbor} query. We already have indices and algorithms to solve emph{k}-nearest neighbors queries in general metric spaces; yet, in many cases of practical interest they degenerate to sequential scanning. The naive algorithm for reverse emph{k}-nearest neighbor queries has quadratic complexity, because the emph{k}-nearest neighbors of all the dataset objects must be found; this is too expensive. Hence, when solving these primitives we can tolerate trading correctness in the solution for searching time. In this paper we propose an efficient approximate approach to solve these similarity queries with high retrieval rate. Then, we show how to use our approximate emph{k}-nearest neighbor queries to construct (an approximation of) the emph{k-nearest neighbor graph} when we have a fixed dataset. Finally, combining both primitives we show how to emph{dynamically maintain} the approximate emph{k}-nearest neighbor graph of the objects currently stored within the metric dataset, that is, considering both object insertions and deletions.
机译:检索查询对象的EMPH {k-reallibor}是相似性搜索中的基本原语。一个相关的,远更少于探索的原始是要获得数据集元素,该数据集元素将在其自己的Emph {k} - 最终邻居内具有查询对象,称为EMPH {反向k最近邻居}查询。我们已经有指数和算法来解决常规度量空间中的Emph {k} - 最终邻居查询;然而,在许多实际兴趣的情况下,他们退化为连续扫描。反向Emph {k} -Nealest邻居查询的天真算法具有二次复杂性,因为必须找到所有数据集对象的Emph {k} - 最早邻居;这太贵了。因此,当解决这些基元时,我们可以容忍解决方案中的交易正确性进行搜索时间。在本文中,我们提出了一种有效的近似方法来解决具有高检索率的这些相似性查询。然后,我们展示如何使用我们的近似弹性{k} - 最邻居查询来构建(近似)EMPH {k最近邻图}时,我们有一个固定的数据集。最后,组合两个基元我们展示了如何表明当前存储在度量数据集中当前存储在度量数据集中的对象的近似Emph {k}的邻居图,即考虑对象插入和删除。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号