首页> 外文会议>International Conference on Similarity Search and Applications >Confirmation Sampling for Exact Nearest Neighbor Search
【24h】

Confirmation Sampling for Exact Nearest Neighbor Search

机译:确认精确最近邻的搜索采样

获取原文

摘要

Locality-sensitive hashing (LSH), introduced by Indyk and Motwani in STOC '98, has been an extremely influential framework for nearest neighbor search in high-dimensional data sets. While theoretical work has focused on the approximate nearest neighbor problem, in practice LSH data structures with suitably chosen parameters are used to solve the exact nearest neighbor problem (with some error probability). Sublinear query time is often possible in practice even for exact nearest neighbor search, intuitively because the nearest neighbor tends to be significantly closer than other data points. However, theory offers little advice on how to choose LSH parameters outside of pre-specified worst-case settings. We introduce the technique of confirmation sampling for solving the exact nearest neighbor problem using LSH. First, we give a general reduction that transforms a sequence of data structures that each find the nearest neighbor with a small, unknown probability, into a data structure that returns the nearest neighbor with probability 1-δ, using as few queries as possible. Second, we present a new query algorithm for the LSH Forest data structure with L trees that is able to return the exact nearest neighbor of a query point within the same time bound as an LSH Forest of Ω(L) trees with internal parameters specifically tuned to the query and data.
机译:局部性敏感散列(LSH),通过达克和Motwani在STOC '98推出,一直在高维数据集最近邻搜索一个极具影响力的框架。虽然理论工作都集中在近似最邻近的问题,与适当选择的参数实践LSH数据结构被用来解决确切最近邻问题(有一些错误概率)。次线性查询时间往往是在实践中可能甚至精确近邻搜索,直观的,因为最近的邻居往往是显著更接近比其它数据点。然而,理论提供了关于如何预先规定的最坏情况设置外选择LSH参数有点意见。我们引进的确认采样解决使用LSH确切的近邻问题的技术。首先,我们给出一个普遍下降即转换数据结构的序列,每找到一个小的,未知的概率在最近的邻居,为数据结构与回报概率为1-δ近邻,使用尽可能少的查询尽可能。第二,我们提出了以L树木LSH森林数据结构,其能够结合作为Ω的LSH森林(L)与特别调谐的内部参数树相同的时间内返回的查询点的精确最近邻居的新查询算法对查询和数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号