Semantic image retrieval for complex queries using a knowledge parser

Chen Hua; Trouve Antoine; Murakami Kazuaki J.; Fukuda Akira

首页> 外文期刊>Multimedia Tools and Applications >Semantic image retrieval for complex queries using a knowledge parser

【24h】

Semantic image retrieval for complex queries using a knowledge parser

机译：使用知识解析器对复杂查询进行语义图像检索

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In order to improve the retrieval accuracy of image retrieval systems, research focus has been shifted from designing sophisticated low-level feature extraction algorithms to combining image retrieval processing with rich semantics and knowledge-based methods. In this paper, we aim at improving text-based image retrieval for complex natural language queries by using a semantic parser (Knowledge Parser or K-Parser). From text written in natural language, the K-parser extracts a graphical semantic representation of the objects involved, their properties as well as their relations. We analyze both the image textual captions and the natural language queries with the K-parser. As a technical solution, we leverage RDF in two ways: first, we store the parsed image captions as RDF triples; second, we translate image queries into SPARQL queries. When applied to the Flickr8k dataset with a set of 16 custom queries, we notice that the K-parser exhibits some biases that negatively affect the accuracy of the queries. We propose two techniques to address the weaknesses: (1) we introduce a set of rules to transform the output of K-parser and fix some basic, recurrent parsing mistakes that occur on the captions of Flickr8k; (2) we leverage two popular commonsense knowledge databases, ConceptNet and WordNet, to raise the accuracy of queries on broad concepts. Using those two techniques, we can fix most of the initial retrieval errors, and accurately execute our set of 16 queries on the Flickr8k dataset.

机译：为了提高图像检索系统的检索精度，研究重点已从设计复杂的低级特征提取算法转变为将图像检索处理与丰富的语义和基于知识的方法相结合。在本文中，我们旨在通过使用语义解析器（知识解析器或K-Parser）改善复杂自然语言查询的基于文本的图像检索。 K分析器从用自然语言编写的文本中提取所涉及对象，其属性以及它们之间的关系的图形语义表示。我们使用K解析器分析图像的文字字幕和自然语言查询。作为一种技术解决方案，我们通过两种方式利用RDF：首先，将解析的图像标题存储为RDF的三倍。其次，我们将图像查询转换为SPARQL查询。当将其应用于具有16个自定义查询集的Flickr8k数据集时，我们注意到K分析器表现出一些偏差，这些偏差会对查询的准确性产生负面影响。我们提出了两种技术来解决这些缺点：（1）我们引入了一组规则来转换K分析器的输出，并修复Flickr8k字幕上发生的一些基本的，经常性的分析错误；（2）我们利用两个流行的常识知识数据库ConceptNet和WordNet来提高对广泛概念的查询的准确性。使用这两种技术，我们可以修复大多数初始检索错误，并在Flickr8k数据集上准确执行我们的16个查询集。

著录项

来源
《Multimedia Tools and Applications》 |2018年第9期|10733-10751|共19页
作者
Chen Hua; Trouve Antoine; Murakami Kazuaki J.; Fukuda Akira;
展开▼
作者单位

Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka 8190395, Japan;

Inst Syst Informat Technol & Nanotechnol ISIT, Sawara Ku, 2-1-22-7F Momochihama, Fukuoka, Fukuoka 8140001, Japan;

Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka 8190395, Japan;

Kyushu Univ, Grad Sch Informat Sci & Elect Engn, Nishi Ku, 744 Motooka, Fukuoka, Fukuoka 8190395, Japan;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Image retrieval; Object retrieval; Commonsense knowledge; K-parser; RDF;

机译：图像检索;对象检索;常识;K解析器;RDF;

相似文献

外文文献
中文文献
专利

1. Enhancing image retrieval for complex queries using external knowledge sources [J] . Haitham Samih, Sherine Rady, Tarek F. Gharib Multimedia Tools and Applications . 2020,第37a38期

机译：使用外部知识源增强复杂查询的图像检索
2. Image Retrieval for Complex Queries Using Knowledge Embedding [J] . CHANDRAMANI CHAUDHARY, POONAM GOYAL, NAVNEET GOYAL, ACM transactions on multimedia computing communications and applications . 2020,第1期

机译：使用知识嵌入的复杂查询的图像检索
3. Query Rewriting and Semantic Annotation in Semantic-Based Image Retrieval under Heterogeneous Ontologies of Big Data [J] . Jia Baoxian, Meng Bin, Zhang Wunong, Traitement du Signal . 2020,第1期

机译：在大数据的异构本体下基于语义的图像检索中的查询重写与语义注释
4. A Semantic Query Interpreter framework by using knowledge bases for image search and retrieval [C] . 10th IEEE International Symposium on Signal Processing and Information Technology . 2010

机译：使用知识库进行图像搜索和检索的语义查询解释器框架
5. Image retrieval based on complex descriptive queries. [D] . Siddiquie, Behjat. 2011

机译：基于复杂描述性查询的图像检索。
6. Towards knowledge-based retrieval of medical images. The role of semantic indexing image content representation and knowledge-based retrieval. [O] . H. J. Lowe, I. Antipov, W. Hersh, 1998

机译：致力于基于知识的医学图像检索。语义索引图像内容表示和基于知识的检索的作用。
7. Semantic Parsing via Staged Query Graph Generation: Question Answering with Knowledge Base [O] . Wen-tau Yih, Ming-Wei Chang, Xiaodong He, 2015

机译：通过阶段查询图的语义解析：与知识库的问题回答
8. Learning Effective and Robust Knowledge for Semantic Query Optimization [R] . Hsu, C. N. 1996

机译：学习语义查询优化的有效而强大的知识

Semantic image retrieval for complex queries using a knowledge parser

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅