首页> 外文期刊>Multimedia Tools and Applications >Descriptor optimization for multimedia indexing and retrieval
【24h】

Descriptor optimization for multimedia indexing and retrieval

机译:多媒体索引和检索的描述符优化

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we propose and evaluate a method for optimizing descriptors used for content-based multimedia indexing and retrieval. A large variety of descriptors are commonly used for this purpose. However, the most efficient ones often have characteristics preventing them to be easily used in large scale systems. They may have very high dimensionality (up to tens of thousands dimensions) and/or be suited for a distance which is costly to compute (e.g. x~2)- The proposed method combines a PCA-based dimensionality reduction with pre- and post-PCA non-linear transformations. The resulting transformation is globally optimized. The produced descriptors have a much lower dimensionality while performing at least as well, and often significantly better, with the Euclidean distance than the original high dimensionality descriptors with their optimal distance. Our approach also includes a hyper-parameter optimization procedure based on the use of a fast kNN classifier and on a polynomial fit to overcome the MAP metric instability. The method has been validated and evaluated on a variety of descriptors using the TRECVid 2010 semantic indexing task data. It has been applied at large scale for the TRECVid 2012 semantic indexing task on tens of descriptors of various types and with initial dimensionalities ranging from 15 up to 32,768. The same transformation can be used also for multimedia retrieval in the context of query by example and/or of relevance feedback.
机译:在本文中,我们提出并评估了一种优化用于基于内容的多媒体索引和检索的描述符的方法。为此目的通常使用各种各样的描述符。但是,最高效的系统通常具有某些特性,使其无法在大型系统中轻松使用。它们可能具有非常高的尺寸(多达数万个尺寸)和/或适合于计算成本高的距离(例如x〜2)。所提出的方法将基于PCA的尺寸减少与前后PCA非线性变换。由此产生的转换是全局优化的。所产生的描述符具有极低的维数,同时在欧几里得距离方面的性能至少也要好,并且通常要好于具有最佳距离的原始高维描述符。我们的方法还包括基于使用快速kNN分类器和多项式拟合的超参数优化过程,以克服MAP度量不稳定性。该方法已使用TRECVid 2010语义索引任务数据在各种描述符上得到验证和评估。 TRECVid 2012语义索引任务已将其大规模应用在数十个各种类型的描述符上,并且初始维度在15到32,768之间。在示例查询和/或相关反馈的情况下,相同的变换也可以用于多媒体检索。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号