【24h】

Hypersphere Indexer

机译:超球索引器

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Indexing high-dimensional data for efficient nearest-neighbor searches poses interesting research challenges. It is well known that when data dimension is high, the search time can exceed the time required for performing a linear scan on the entire dataset. To alleviate this dimensionality curse, indexing schemes such as locality sensitive hashing (LSH) and M-trees were proposed to perform approximate searches. In this paper, we propose a hypersphere in-dexer, named Hydex, to perform such searches. Hydex partitions the data space using concentric hyperspheres. By exploiting geometric properties, Hydex can perform effective pruning. Our empirical study shows that Hydex enjoys three advantages over competing schemes for achieving the same level of search accuracy. First, Hydex requires fewer seek operations. Second, Hydex can maintain sequential disk accesses most of the time. And third, it requires fewer distance computations.
机译:为高效的最近邻居搜索索引高维数据提出了有趣的研究挑战。众所周知,当数据维数较高时,搜索时间可能会超过对整个数据集执行线性扫描所需的时间。为了减轻这种维数诅咒,提出了诸如局部敏感散列(LSH)和M树之类的索引方案来执行近似搜索。在本文中,我们提出了一种名为Hydex的超球体索引器来执行此类搜索。 Hydex使用同心超球来划分数据空间。通过利用几何特性,Hydex可以执行有效的修剪。我们的实证研究表明,Hydex在获得相同水平的搜索准确性方面比竞争方案具有三个优势。首先,Hydex需要更少的查找操作。其次,Hydex可以在大多数时间维持顺序的磁盘访问。第三,它需要更少的距离计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号