首页> 外文会议>International Workshop on Deterministic and Statistical Methods in Machine Learning >Redundant Bit Vectors for Quickly Searching High-Dimensional Regions
【24h】

Redundant Bit Vectors for Quickly Searching High-Dimensional Regions

机译:冗余位向量,用于快速搜索高维地区

获取原文

摘要

Applications such as audio fingerprinting require search in high dimensions: find an item in a database that is similar to a query. An important property of this search task is that negative answers are very frequent: much of the time, a query does not correspond to any database item. We propose Redundant Bit Vectors (RBVs): a novel method for quickly solving this search problem. RBVs rely on three key ideas: 1) approximate the high-dimensional regions/distributions as tightened hyperrectangles, 2) partition the query space to store each item redundantly in an index and 3) use bit vectors to store and search the index efficiently. We show that our method is the preferred method for very large databases or when the queries are often not in the database. Our method is 109 times faster than linear scan, and 48 times faster than locality-sensitive hashing on a data set of 239369 audio fingerprints.
机译:音频指纹识别的应用程序需要高维搜索:在类似于查询的数据库中找到一个项目。此搜索任务的一个重要属性是负答案非常频繁:大部短时间,查询与任何数据库项都不对应。我们提出了冗余位矢量(RBV):一种快速解决此搜索问题的新方法。 RBVS依赖于三个关键的想法:1)近似高维地区/分布为紧固超直角,2)分区查询空间在索引中冗余地存储每个项目,3)使用比特向量有效地存储和搜索索引。我们表明我们的方法是非常大的数据库的首选方法,或者查询通常不在数据库中。我们的方法比线性扫描快109倍,并且在239369音频指纹的数据集上的位置敏感散列速度快48倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号