...
首页> 外文期刊>Expert Systems with Application >Multimodal Retrieval using Mutual Information based Textual Query Reformulation
【24h】

Multimodal Retrieval using Mutual Information based Textual Query Reformulation

机译:使用基于互信息的文本查询重构的多模式检索

获取原文
获取原文并翻译 | 示例
           

摘要

Multimodal Retrieval is a well-established approach for image retrieval. Usually, images are accompanied by text caption along with associated documents describing the image. Textual query expansion as a form of enhancing image retrieval is a relatively less explored area. In this paper, we first study the effect of expanding textual query on both image and its associated text retrieval. Our study reveals that judicious expansion of textual query through keyphrase extraction can lead to better results, either in terms of text retrieval or both image and text-retrieval. To establish this, we use two well-known keyphrase extraction techniques based on tf-idf and KEA. While query expansion results in increased retrieval efficiency, it is imperative that the expansion be semantically justified. So, we propose a graph-based keyphrase extraction model that captures the relatedness between words in terms of both mutual information and relevance feedback. Most of the existing works have stressed on bridging the semantic gap by using textual and visual features, either in combination or individually. The way these text and image features are combined determines the efficacy of any retrieval. For this purpose, we adopt Fisher-LDA to adjudge the appropriate weights for each modality. This provides us with an intelligent decision-making process favoring the feature set to be infused into the final query. Our proposed algorithm is shown to supersede the previously mentioned keyphrase extraction algorithms for query expansion significantly. A rigorous set of experiments performed on ImageCLEF-2011 Wikipedia Retrieval task dataset validates our claim that capturing the semantic relation between words through Mutual Information followed by expansion of a textual query using relevance feedback can simultaneously enhance both text and image retrieval. (C) 2016 Elsevier Ltd. All rights reserved.
机译:多峰检索是一种完善的图像检索方法。通常,图像带有文本标题以及描述该图像的相关文档。文本查询扩展作为增强图像检索的一种形式,是一个相对较少探索的领域。在本文中,我们首先研究了扩展文本查询对图像及其相关文本检索的影响。我们的研究表明,通过关键词提取明智地扩展文本查询可以带来更好的结果,无论是在文本检索方面还是在图像检索和文本检索方面。为此,我们使用了两种基于tf-idf和KEA的著名关键词提取技术。虽然查询扩展可提高检索效率,但必须在语义上证明扩展是合理的。因此,我们提出了一种基于图的关键词提取模型,该模型可以根据互信息和相关性反馈来捕获单词之间的相关性。现有的大多数作品都强调通过组合或单独使用文本和视觉功能来弥合语义鸿沟。这些文本和图像功能的组合方式决定了任何检索的功效。为此,我们采用Fisher-LDA来为每种方式确定适当的权重。这为我们提供了一个智能的决策过程,该过程有利于将特征集注入到最终查询中。我们提出的算法显示出可以显着地取代前面提到的关键字提取算法,以实现查询扩展。在ImageCLEF-2011 Wikipedia Retrieval任务数据集上进行的一组严格的实验验证了我们的主张,即通过互信息捕获单词之间的语义关系,然后使用相关性反馈扩展文本查询可以同时增强文本和图像检索。 (C)2016 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号