【24h】

INDEXING IMAGES IN HIGH-DIMENSIONAL AND DYNAMIC-WEIGHTED FEATURE SPACES

机译:高维和动态加权特征空间中的索引图像

获取原文
获取原文并翻译 | 示例

摘要

As information retrieval systems evolve to deal with multimedia content, we see the dimensions of content feature space increasing, and relevance feedback being employed to provide more accurate query results. In this paper, we propose using Tree-structured Vector Quantization (TSVQ) to index high-dimensional data for supporting efficient similarity searches and effective relevance feedback. To support efficient similarity searches, we first use TSVQ to cluster data and store each cluster in a sequential file. We then model a similarity search as a classification problem ― similar objects are much more likely to be found in the clusters into which the query object is classified. When relevance feedback is considered, and thereby features are weighted differently, we show that our approach remains very effective. Our empirical study on both a 51K and a one-million-image dataset shows that tackling indexing as a classification problem and solving the problem with TSVQ is efficient, effective, and scalable with respect to both data dimensions and dataset size.
机译:随着信息检索系统发展为处理多媒体内容,我们看到内容特征空间的尺寸不断增加,并且采用了相关反馈来提供更准确的查询结果。在本文中,我们建议使用树结构矢量量化(TSVQ)为高维数据建立索引,以支持有效的相似性搜索和有效的相关性反馈。为了支持有效的相似性搜索,我们首先使用TSVQ对数据进行聚类并将每个聚类存储在顺序文件中。然后,我们将相似性搜索建模为分类问题-在将查询对象分类到的群集中,更可能找到相似的对象。当考虑相关性反馈并由此对特征进行加权时,我们证明了我们的方法仍然非常有效。我们对51K和一百万张图像数据集的经验研究表明,将索引作为分类问题和使用TSVQ解决问题对于数据维度和数据集大小而言都是高效,有效和可扩展的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号