首页> 外文期刊>Journal of combinatorial optimization >Enabling high-dimensional range queries using kNN indexing techniques: approaches and empirical results
【24h】

Enabling high-dimensional range queries using kNN indexing techniques: approaches and empirical results

机译:使用kNN索引技术实现高维范围查询:方法和经验结果

获取原文
获取原文并翻译 | 示例
           

摘要

Many modern search applications are high-dimensional and depend on efficient orthogonal range queries. These applications span web-based and scientific needs as well as uses for data mining. Although k-nearest neighbor queries are becoming increasingly common due to mobile and geospatial applications, orthogonal range queries in high-dimensional data remain extremely important and relevant. For efficient querying, data is typically stored in an index optimized for either kNN or range queries. This can be problematic when data is optimized for kNN retrieval and a user needs a range query or vice versa. Here, we address the issue of using a kNN-based index for range queries, as well as outline the general computational geometry problem of adapting these systems to range queries. We refer to these methods as space-based decompositions and provide a straightforward heuristic for this problem. Using iDistance as our applied kNN indexing technique, we also develop an optimal (data-based) algorithm designed specifically for its indexing scheme. We compare this method to the suggested na < ve approach using real world datasets. The data-based algorithm consistently performs better.
机译:许多现代搜索应用程序都是高维的,并且依赖于有效的正交范围查询。这些应用程序跨越了基于Web的科学需求以及数据挖掘的用途。尽管k近邻查询由于移动和地理空间应用而变得越来越普遍,但高维数据中的正交范围查询仍然非常重要和相关。为了有效查询,数据通常存储在针对kNN或范围查询优化的索引中。当针对kNN检索优化了数据并且用户需要范围查询或相反时,这可能会出现问题。在这里,我们解决了将基于kNN的索引用于范围查询的问题,并概述了使这些系统适应范围查询的一​​般计算几何问题。我们将这些方法称为基于空间的分解,并为该问题提供了一种直观的启发式方法。使用iDistance作为我们应用的kNN索引技术,我们还开发了专门为其索引方案设计的最佳(基于数据)算法。我们将这种方法与使用真实数据集的建议的朴素方法进行了比较。基于数据的算法始终表现更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号