首页> 外文期刊>Information Systems >Cost-aware query planning for similarity search
【24h】

Cost-aware query planning for similarity search

机译:用于相似性搜索的成本意识查询计划

获取原文
获取原文并翻译 | 示例
       

摘要

Similarity search aims to find all objects similar to a query object. Typically, some base similarity measures for the different properties of the objects are defined, and lightweight similarity indexes for these measures are built. A query plan specifies which similarity indexes to use with which similarity thresholds and how to combine the results. Previous work creates only a single, static query plan to be used by all queries. In contrast, our approach creates a new plan for each query. We introduce the novel problem of query planning for similarity search, i.e., selecting for each query the plan that maximizes completeness of the results with cost below a query-specific limit. By regarding the frequencies of attribute values we are able to better estimate plan completeness and cost, and thus to better distribute our similarity comparisons. Evaluation on a large real-world dataset shows that our approach significantly reduces cost variance and increases overall result completeness compared to static query plans.
机译:相似性搜索旨在查找与查询对象相似的所有对象。通常,为对象的不同属性定义一些基本的相似性度量,并为这些度量建立轻量级的相似性索引。查询计划指定要使用哪些相似性索引和哪些相似性阈值,以及如何组合结果。先前的工作仅创建一个静态查询计划,供所有查询使用。相反,我们的方法为每个查询创建了一个新计划。我们介绍了用于相似性搜索的查询计划的新问题,即为每个查询选择在成本低于特定查询限制的情况下最大化结果完整性的计划。通过考虑属性值的频率,我们能够更好地估计计划的完整性和成本,从而更好地分布我们的相似度比较。对大型现实数据集的评估表明,与静态查询计划相比,我们的方法显着减少了成本差异并提高了整体结果的完整性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号