【24h】

Query-sensitive embeddings

机译:查询敏感的嵌入

获取原文

摘要

A common problem in many types of databases is retrieving the most similar matches to a query object. Finding those matches in a large database can be too slow to be practical, especially in domains where objects are compared using computationally expensive similarity (or distance) measures. This paper proposes a novel method for approximate nearest neighbor retrieval in such spaces. Our method is embedding-based, meaning that it constructs a function that maps objects into a real vector space. The mapping preserves a large amount of the proximity structure of the original space, and it can be used to rapidly obtain a short list of likely matches to the query. The main novelty of our method is that it constructs, together with the embedding, a query-sensitive distance measure that should be used when measuring distances in the vector space. The term "query-sensitive" means that the distance measure changes depending on the current query object. We report experiments with an image database of handwritten digits, and a time-series database. In both cases, the proposed method outperforms existing state-of-the-art embedding methods, meaning that it provides significantly better trade-offs between efficiency and retrieval accuracy.
机译:在许多类型的数据库中,一个常见的问题是检索与查询对象最相似的匹配项。在大型数据库中找到这些匹配项可能太慢而无法实用,特别是在使用计算量大的相似性(或距离)度量比较对象的域中。本文提出了一种在此类空间中近似最近邻检索的新方法。我们的方法是基于嵌入的,这意味着它构造了一个将对象映射到真实向量空间的函数。映射保留了原始空间的大量邻近结构,可用于快速获取与查询可能匹配的简短列表。我们的方法的主要新颖之处在于,它与嵌入一起构造了一个查询敏感的距离度量,该度量在测量向量空间中的距离时应使用。术语“查询敏感”是指距离量度根据当前查询对象而变化。我们用手写数字图像数据库和时间序列数据库报告实验。在这两种情况下,提出的方法都优于现有的最新嵌入方法,这意味着它在效率和检索精度之间提供了更好的折衷方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号