首页> 外文期刊>Journal of Theoretical and Applied Information Technology >UNIFIED CONCEPT-BASED MULTIMEDIA INFORMATION RETRIEVAL SYSTEM USING DEEP LEARNING PROCESS WITH ONTOLOGY
【24h】

UNIFIED CONCEPT-BASED MULTIMEDIA INFORMATION RETRIEVAL SYSTEM USING DEEP LEARNING PROCESS WITH ONTOLOGY

机译:基于本体的深度学习过程的基于概念的统一多媒体信息检索系统

获取原文
       

摘要

The amount of digital data is growing at a staggering pace and mostly in the form of text. The growth of data, including multimedia (text, images, video, and audio) poses challenges in developing Multimedia Information Retrieval Systems (MIRS). Today, MIRS uses one or two media as a query input, for example, Google using text and image. There are comprehensive information needs in multimedia, including video and audio media as query input that can increase the amount and variety of information in retrieval result. Also, it is difficult to organize the relationship between the query input and the retrieval result in the same context or semantically related. This paper proposes a Unified Concept-based MIRS using deep learning with Ontology to tackle these problems. There are three main processes of this research; the first is Indexing Process which consist of collecting multimedia data, creating the multimedia dataset, extracting multimedia features, identifying and classifying objects and media format with Convolutional Neural Network (CNN) and Recurrent Neural Network (RNN), then storing object indexed as the concepts. The second is Query Processing which consists of inputting multimedia query or object to identifying and classifying object becomes a concept, and then the third is Retrieval and Rank Process of concept or object indexed and concept of query input. The ontology organizes the relationship between the concepts. Resource of this research uses the Cultural Heritage domain. The Unified Concept-based MIRS shows the capability of the system to extract features of four media (text, image, audio, and video) as concept representation to identify the multimedia object from query input and retrieve the object in four types of media at once. The retrieval results performance with deep learning increase about 30%-40% than support vector machine technique, and the usage of ontology increases the retrieval relevant results about 30 % is compared with MIRS without Ontology.
机译:数字数据量以惊人的速度增长,并且大多数以文本形式增长。包括多媒体(文本,图像,视频和音频)在内的数据的增长给开发多媒体信息检索系统(MIRS)带来了挑战。如今,MIRS使用一种或两种媒体作为查询输入,例如Google使用文本和图像。多媒体中存在着广泛的信息需求,包括视频和音频媒体作为查询输入,可以增加检索结果中信息的数量和种类。而且,很难在相同的上下文或语义相关的情况下组织查询输入和检索结果之间的关系。本文提出了一种基于深度学习与本体论的基于统一概念的MIRS,以解决这些问题。这项研究有三个主要过程:第一个是索引过程,该过程包括收集多媒体数据,创建多媒体数据集,提取多媒体特征,使用卷积神经网络(CNN)和递归神经网络(RNN)对对象和媒体格式进行识别和分类,然后将被索引的对象存储为概念。第二个是查询处理,它由输入多媒体查询或对象以识别和分类对象成为一个概念组成,然后第三个是概念或对象索引和查询输入的概念的检索和排序过程。本体组织了概念之间的关系。本研究的资源使用文化遗产领域。基于统一概念的MIRS显示了系统提取四种媒体(文本,图像,音频和视频)的特征作为概念表示的能力,以从查询输入中识别多媒体对象并一次检索四种类型的媒体中的对象。深度学习的检索结果性能比支持向量机技术提高了约30%-40%,与没有本体的MIRS相比,本体的使用使检索相关结果提高了约30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号