首页> 外文期刊>IEICE transactions on information and systems >Pivot Generation Algorithm with a Complete Binary Tree for Efficient Exact Similarity Search
【24h】

Pivot Generation Algorithm with a Complete Binary Tree for Efficient Exact Similarity Search

机译:具有完整二叉树的数据透视生成算法,可进行精确的精确相似搜索

获取原文
获取外文期刊封面目录资料

摘要

This paper presents a pivot-set generation algorithm for accelerating exact similarity search in a large-scale data set. To deal with the large-scale data set, it is important to efficiently construct a search index offline as well as to perform fast exact similarity search online. Our proposed algorithm efficiently generates competent pivots with two novel techniques: hierarchical data partitioning and fast pivot optimization techniques. To make effective use of a small number of pivots, the former recursively partitions a data set into two subsets with the same size depending on the rank order from each of two assigned pivots, resulting in a complete binary tree. The latter calculates a defined objective function for pivot optimization with a low computational cost by skillfully operating data objects mapped into a pivot space . Since the generated pivots provide the tight lower bounds on distances between a query object and the data objects, an exact similarity search algorithm effectively avoids unnecessary distance calculations. We demonstrate that the search algorithm using the pivots generated by the proposed algorithm reduces distance calculations with an extremely high rate regarding a range query problem for real large-scale image data sets.
机译:本文提出了一种枢轴集生成算法,用于加速大规模数据集中的精确相似性搜索。为了处理大规模数据集,重要的是有效地离线构建搜索索引以及在线快速执行精确相似搜索。我们提出的算法可通过两种新颖的技术有效地生成主管枢轴:分层数据划分和快速枢纽优化技术。为了有效利用少量枢轴,前者将数据集递归地划分为两个子集,该子集具有相同的大小,具体取决于两个已分配枢轴中每个枢轴的排名,从而形成一个完整的二叉树。后者通过熟练地操作映射到枢轴空间的数据对象,以较低的计算成本为枢轴优化计算定义的目标函数。由于生成的数据透视表提供了查询对象与数据对象之间距离的严格下限,因此精确的相似度搜索算法可有效避免不必要的距离计算。我们证明了使用由所提出的算法生成的支点的搜索算法,对于真实的大规模图像数据集的范围查询问题,以极高的速率减少了距离计算。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号