首页> 外文会议>International conference on management of data >Flexible Aggregate Similarity Search
【24h】

Flexible Aggregate Similarity Search

机译:灵活的聚合相似性搜索

获取原文

摘要

Aggregate similarity search, a.k.a. aggregate nearest neighbor (Ann) query, finds many useful applications in spatial and multimedia databases. Given a group Q of M query objects, it retrieves the most (or top-k) similar object to Q from a database P, where the similarity is an aggregation (e.g., sum, max) of the distances between the retrieved object p and all the objects in Q. In this paper, we propose an added flexibility to the emery definition, where the similarity is an aggregation over the distances between p and any subset of ΦM objects in Q for some support 0 < Φ> ≤ 1. We call this new definition flexible aggregate similarity (Fann) search, which generalizes the Ann problem. Next, we present algorithms for answering Fann queries exactly and approximately. Our approximation algorithms are (specially appealing, which are simple, highly efficient, and work well in both low and high dimensions. They also return near-optimal answers with guaranteed constant-factor approximations in any dimensions. Extensive experiments on large real and synthetic datasets from 2 to 74 dimensions have demonstrated their superior efficiency and high quality.
机译:聚合相似性搜索A.k.a.聚合最近邻(ANN)查询,在空间和多媒体数据库中查找许多有用的应用程序。给定M个查询对象的Q,它从数据库P检索到Q的最常(或TOP-K),其中相似度是检索到的对象P和的距离的聚合(例如,总和,最大值) Q中的所有对象。在本文中,我们提出了对Emery定义的额外灵活性,其中相似性是在P和某些支持中的QφM对象之间的距离上的聚合0 <φ>≤1.我们调用此新定义灵活的聚合相似性(FANN)搜索,概括了ANN问题。接下来,我们提供了用于究竟且大约且大致回答FANN查询的算法。我们的近似算法(特别吸引力,这是简单,高效的,并且在低维度和高维度上运行良好。它们还返回近乎最佳答案,在任何维度中具有保证的恒因子近似。大量实际和合成数据集从2到74维度展示了它们的效率优良和高质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号