Semantic Understanding for Video Retrieval with Temporal Multimodal Fusion Analysis

机译：具有时间多模式融合分析的视频检索语义理解

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

As the use of video increases in the digital library, offering effective retrieval service for video is a great demand. However, compared with text and images, video lacks of obvious semantics although video has rich multimodal information, such as text transcripts, visual/audio features and temporal structure. Therefore, understanding semantics embedded in video is necessary for video retrieval. In this paper, we propose a comprehensive approach to semantic understanding of video through automatic annotation with temporal multimodal fusion analysis. Various media aspects are investigated, including meaningful words and contextual distribution in the transcript, visual/audio features, and most importantly, the temporal interval relations involved in video. TFIDF retrieval method with score propagation is used to discover the association between a shot and its corresponding transcript. Experiments on the TRECVID 2003 dataset show that our approach achieves high performance.

机译：随着数字图书馆的视频增加，为视频提供有效的检索服务是一个很大的需求。但是，与文本和图像相比，视频缺乏明显的语义，尽管视频具有丰富的多模式信息，例如文本抄本，可视/音频功能和时间结构。因此，嵌入在视频中的理解语义是视频检索所必需的。在本文中，我们提出了一种通过具有时间多模式融合分析的自动注释来实现对视频的语义理解的综合方法。调查各种媒体方面，包括在记录，视觉/音频功能中的有意义的单词和上下文分发，以及最重要的是，视频中涉及的时间间隔关系。具有分数传播的TFIDF检索方法用于发现拍摄和其相应的转录之间的关联。 TRECVID 2003数据集的实验表明，我们的方法实现了高性能。

著录项

来源
《International Conference on Universal Digital Library》|2005年||共6页
会议地点
作者
ZHUANG Yue-ting; YANG Hui; WU Fei; Minstry of Education(MOE) of China; US National Science Foundation(NSF); Indian Institute of Science(IISc);
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类电子图书馆、数字图书馆;
关键词
video retrieval; semantic understanding; multimedia analysis;

机译：视频检索;语义理解;多媒体分析;

相似文献

外文文献
中文文献
专利

1. Multimodal Information Fusion for Semantic Video Analysis [J] . Elvan Gulen, Turgay Yilmaz, Adnan Yazici International journal of multimedia data engineering & management . 2012,第4期

机译：用于语义视频分析的多模式信息融合
2. Joint modality fusion and temporal context exploitation for semantic video analysis [J] . Papadopoulos Georgios, Mezaris Vasileios, Kompatsiaris Ioannis, EURASIP journal on advances in signal processing . 2011,第20aPta3期

机译：联合模式融合和时态上下文开发，用于语义视频分析
3. Joint modality fusion and temporal context exploitation for semantic video analysis [J] . Georgios Th Papadopoulos, Vasileios Mezaris, Ioannis Kompatsiaris, EURASIP journal on advances in signal processing . 2011,第1期

机译：联合模式融合和时态上下文开发用于语义视频分析
4. Semantic Understanding for Video Retrieval with Temporal Multimodal Fusion Analysis [C] . ZHUANG Yue-ting, YANG Hui, WU Fei International Conference on Universal Digital Library(ICUDL2005); 20051031-1102; Hangzhou(CN) . 2005

机译：时间多模态融合分析的视频检索语义理解
5. DiVAS: Digital-video-audio-sketch capture, retrieval, and understanding of unstructured multimodal design knowledge. [D] . Yin, Zhen. 2006

机译：DiVAS：数字视频，音频草图的捕获，检索和对非结构化多模式设计知识的理解。
6. The Ins and Outs of Meaning: Behavioral and Neuroanatomical Dissociation of Semantically-Driven Word Retrieval and Multimodal Semantic Recognition in Aphasia [O] . Daniel Mirman, Yongsheng Zhang, Ze Wang, -1

机译：意义的来龙去脉：失语的语义驱动词检索与行为和神经解剖学分离和多模态语义识别
7. COMBINING MULTIMODAL AND TEMPORAL CONTEXTUAL INFORMATION FOR SEMANTIC VIDEO ANALYSIS [O] . Georgios Th. Papadopoulos, Vasileios Mezaris, Ioannis Kompatsiaris, 2010

机译：结合语音视频分析的多模态和时间语境信息

Semantic Understanding for Video Retrieval with Temporal Multimodal Fusion Analysis

摘要

著录项

相似文献

相关主题

期刊订阅