Multimodal indexing based on semantic cohesion for image retrieval

Hugo Jair Escalante; Manuel Montes; Enrique Sucar

首页> 外文期刊>Information retrieval >Multimodal indexing based on semantic cohesion for image retrieval

【24h】

Multimodal indexing based on semantic cohesion for image retrieval

机译：基于语义内聚的多峰索引用于图像检索

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces two novel strategies for representing multimodal images with application to multimedia image retrieval. We consider images that are composed of both text and labels: while text describes the image content at a very high semantic level (e.g., making reference to places, dates or events), labels provide a mid-level description of the image (i.e., in terms of the objects that can be seen in the image). Accordingly, the main assumption of this work is that by combining information from text and labels we can develop very effective retrieval methods. We study standard information fusion techniques for combining both sources of information. However, whereas the performance of such techniques is highly competitive, they cannot capture effectively the content of images. Therefore, we propose two novel representations for multimodal images that attempt to exploit the semantic cohesion among terms from different modalities. Such representations are based on distributional term representations widely used in computational linguistics. Under the considered representations the content of an image is modeled by a distribution of co-occurrences over terms or of occurrences over other images, in such a way that the representation can be considered an expansion of the multimodal terms in the image. We report experimental results using the SAIAPR TCI2 benchmark on two sets of topics used in ImageCLEF competitions with manually and automatically generated labels. Experimental results show that the proposed representations outperform significantly both, standard multimodal techniques and unimodal methods. Results on manually assigned labels provide an upper bound in the retrieval performance that can be obtained, whereas results with automatically generated labels are encouraging. The novel representations are able to capture more effectively the content of multimodal images. We emphasize that although we have applied our representations to multimedia image retrieval the same formulation can be adopted for modeling other multimodal documents (e.g., videos).

机译：本文介绍了两种用于表示多峰图像的新颖策略，并将其应用于多媒体图像检索。我们考虑由文字和标签组成的图片：尽管文字在非常高的语义级别（例如，引用地点，日期或事件）描述图片内容，但是标签提供了图片的中级描述（即，就可以在图像中看到的对象而言）。因此，这项工作的主要假设是，通过结合来自文本和标签的信息，我们可以开发出非常有效的检索方法。我们研究用于合并两种信息源的标准信息融合技术。但是，尽管此类技术的性能具有很高的竞争力，但它们无法有效捕获图像的内容。因此，我们为多模式图像提出了两种新颖的表示形式，它们试图利用不同模式的术语之间的语义衔接。这样的表示是基于在计算语言学中广泛使用的分布项表示。在所考虑的表示之下，图像的内容是通过在词项上共现或在其他图像上发生的事件的分布来建模的，这样就可以将表示形式视为图像中多峰项的扩展。我们使用SAIAPR TCI2基准报告了针对ImageCLEF竞赛中使用的两组主题的实验结果，这些主题具有手动和自动生成的标签。实验结果表明，所提出的表示形式均明显优于标准多峰技术和单峰方法。手动分配标签的结果提供了可以获取的检索性能的上限，而自动生成标签的结果令人鼓舞。新颖的表示法能够更有效地捕获多模式图像的内容。我们强调，尽管我们已将表示形式应用于多媒体图像检索，但是可以采用相同的公式来建模其他多峰文档（例如视频）。

著录项

来源
《Information retrieval》 |2012年第1期|p.1-32|共32页
作者
Hugo Jair Escalante; Manuel Montes; Enrique Sucar;
展开▼
作者单位

Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro # 1, 72840 Puebla, Mexico;

Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro # 1, 72840 Puebla, Mexico;

Computer Science Department, National Institute of Astrophysics, Optics and Electronics, Luis Enrique Erro # 1, 72840 Puebla, Mexico;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
multimedia image retrieval; image annotation; distributional term representations; semantic cohesion modeling;

机译：多媒体图像检索;图像注释;分布项表示;语义衔接建模;

相似文献

外文文献
中文文献
专利

1. Automatic Semantic Indexing Of Medical Images Using A Web Ontologylanguage For Case-based Image Retrieval [J] . Gowri Allampalli-Nagaraj, Isabelle Bichindaritz Engineering Applications of Artificial Intelligence . 2009,第1期

机译：使用Web本体语言进行基于案例的图像检索的医学图像自动语义索引
2. An efficient radix trie-based semantic visual indexing model for large-scale image retrieval in cloud environment [J] . Krishnaraj N., Elhoseny Mohamed, Lydia E. Laxmi, Software, practice & experience . 2021,第3期

机译：基于基于基于Radix的基于Radix的语义视觉索引模型，用于云环境中的大型图像检索
3. Knowledge-Based Semantic Retrieval of Multimedia and Image Objects Using Collaborative Indexing [J] . Prof. C. H. C. Leung, Yuanxi Li, W. S. Chan International Journal of E-Business Development . 2013,第4期

机译：基于协作索引的基于知识的多媒体和图像对象语义检索
4. Image Semantic Extraction Using Latent Semantic Indexing on Image Retrieval Automatic-Annotation [C] . Herdiyeni Yeni, Nurdiati Sri, Daud Imam Abu Soft Computing and Pattern Recognition, 2009. SOCPAR '09 . 2009

机译：基于潜在语义索引的图像检索自动标注提取图像语义
5. Content-Based Retrieval of Arabic Historical Manuscripts Using Latent Semantic Indexing [D] . Yahia, Mohammad Husni Najib 2011

机译：基于内容的潜在语义索引对阿拉伯历史手稿的基于内容的检索
6. Towards knowledge-based retrieval of medical images. The role of semantic indexing image content representation and knowledge-based retrieval. [O] . H. J. Lowe, I. Antipov, W. Hersh, 1998

机译：致力于基于知识的医学图像检索。语义索引图像内容表示和基于知识的检索的作用。
7. A Latent Semantic Indexing Based Method for Solving Multiple Instance Learning Problem in Region-Based Image Retrieval [O] . Xin Chen, Chengcui Zhang, Shu-ching Chen, 2008

机译：基于潜在语义索引的区域图像检索中多实例学习问题求解方法
8. System for Indexing Multi-Spectral Satellite Images for Efficient Content-Based Retrieval [R] . Barros, J., French, J., Martin, W., 2003

机译：用于高效内容检索的多光谱卫星图像索引系统

Multimodal indexing based on semantic cohesion for image retrieval

摘要

著录项

相似文献

相关主题

期刊订阅