首页> 外文会议>Australasian Database Conference >Experimental Analysis of Locality Sensitive Hashing Techniques for High-Dimensional Approximate Nearest Neighbor Searches
【24h】

Experimental Analysis of Locality Sensitive Hashing Techniques for High-Dimensional Approximate Nearest Neighbor Searches

机译:高维近似邻邻搜索的地方敏感散列技术的实验分析

获取原文

摘要

Finding nearest neighbors in high-dimensional spaces is a fundamental operation in many multimedia retrieval applications. Exact tree-based approaches are known to suffer from the notorious curse of dimensionality for high-dimensional data. Approximate searching techniques sacrifice some accuracy while returning good enough results for faster performance. Locality Sensitive Hashing (LSH) is a popular technique for finding approximate nearest neighbors. There are two main benefits of LSH techniques: they provide theoretical guarantees on the query results, and they are highly scalable. The most dominant costs for existing external memory-based LSH techniques are algorithm time and index I/Os required to find candidate points. Existing works do not compare both of these costs in their evaluation. In this experimental survey paper, we show the impact of both these costs on the overall performance. We compare three state-of-the-art techniques on six real-world datasets, and show the importance of comparing these costs to achieve a more fair comparison.
机译:在高维空间中找到最近的邻居是许多多媒体检索应用中的基本操作。已知精确的基于树的方法遭受高维数据的臭名昭着的维度诅咒。近似搜索技术牺牲了一些准确性,同时返回足够好的结果以获得更快的性能。地区敏感散列(LSH)是一种用于查找近似邻居的流行技术。 LSH技术有两个主要好处:它们提供了对查询结果的理论保证,它们是高度可扩展的。基于外部存储器的LSH技术的最主导成本是查找候选点所需的算法时间和索引I / O.现有的作品不会在评估中比较这些成本。在该实验调查纸中,我们展示了这两种成本对整体性能的影响。我们比较六个现实世界数据集的三种最先进的技术,并显示了比较这些成本实现更公平比较的重要性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号