首页> 外文会议>International Conference on Data Warehousing and Knowledge Discovery >Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data
【24h】

Nearest Neighbor Search on Vertically Partitioned High-Dimensional Data

机译:最近的邻居搜索垂直分区的高维数据

获取原文

摘要

In this paper, we present a new approach to indexing multidimensional data that is particularly suitable for the efficient incremental processing of nearest neighbor queries. The basic idea is to use index-striping that vertically splits the data space into multiple low- and medium-dimensional data spaces. The data from each of these lower-dimensional subspaces is organized by using a standard multi-dimensional index structure. In order to perform incremental NN-queries on top of index-striping efficiently, we first develop an algorithm for merging the results received from the underlying indexes. Then, an accurate cost model relying on a power law is presented that determines an appropriate number of indexes. Moreover, we consider the problem of dimension assignment, where each dimension is assigned to a lower-dimensional sub-space, such that the cost of nearest neighbor queries is minimized. Our experiments confirm the validity of our cost model and evaluate the performance of our approach.
机译:在本文中,我们提出了一种索引多维数据的新方法,特别适用于最近邻查询的有效增量处理。基本思想是使用垂直地将数据空间分成多个低维和中维数据空间的索引条带。通过使用标准的多维索引结构来组织来自这些下维子空间中的每一个的数据。为了有效地在索引条带上执行增量NN查询,我们首先开发一种合并从底层索引接收的结果的算法。然后,提出了一种依赖于权力法的准确成本模型,其确定适当数量的索引。此外,我们考虑维度分配的问题,其中每个维度被分配给低维子空间,使得最近邻权查询的成本最小化。我们的实验证实了我们的成本模式的有效性,并评估了我们的方法的表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号