首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Exploring audio semantic concepts for event-based video retrieval

【24h】

Exploring audio semantic concepts for event-based video retrieval

机译：探索用于基于事件的视频检索的音频语义概念

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The audio semantic concepts (sound events) play important roles in audio-based content analysis. How to capture the semantic information effectively from the complex occurrence pattern of sound events in YouTube quality videos is a challenging problem. This paper presents a novel framework to handle the complex situation for semantic information extraction in real-world videos and evaluate through the NIST multimedia event detection task (MED). We calculate the occurrence confidence matrix of sound events and explore multiple strategies to generate clip-level semantic features from the matrix. We evaluate the performance using TRECVID2011 MED dataset. The proposed method outperforms previous HMM-based system. The late fusion experiment with the low-level features and text feature (ASR) shows that audio semantic concepts capture complementary information in the soundtrack.

机译：音频语义概念（声音事件）在基于音频的内容分析中起着重要的作用。如何有效地从YouTube优质视频中声音事件的复杂发生模式中捕获语义信息是一个具有挑战性的问题。本文提出了一个新颖的框架来处理现实世界视频中语义信息提取的复杂情况，并通过NIST多媒体事件检测任务（MED）进行评估。我们计算声音事件的发生置信度矩阵，并探索多种策略从该矩阵生成剪辑级语义特征。我们使用TRECVID2011 MED数据集评估性能。所提出的方法优于以前的基于HMM的系统。使用低级功能和文本功能（ASR）进行的后期融合实验表明，音频语义概念捕获了配乐中的补充信息。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing 》|2014年|1360-1364|共5页
会议地点
作者
Wang Yipei; Rawat Shourabh; Metze Florian;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
audio processing; multimedia retrieval; semantic concept;

机译：音频处理;多媒体检索;语义概念;

相似文献

外文文献
中文文献
专利

1. Semantic Analysis of Field Sports Video using a Petri-Net of Audio- Visual Concepts [J] . Liang Bai, Songyang Lao, Alan F. Smeaton, The Computer journal . 2009 ,第7期

机译：利用视听概念的Petri网对田径运动视频进行语义分析
2. CLOVIS: towards precision-oriented text-based video retrieval through the unification of automatically-extracted concepts and relations of the visual and audio/speech contents [J] . M. Belkhatir Journal of Intelligent Information Systems . 2010 ,第2期

机译：CLOVIS：通过统一自动提取的概念以及视音频和语音内容之间的关系，实现基于精度的基于文本的视频检索
3. Leveraging visual concepts and query performance prediction for semantic-theme-based video retrieval [J] . Stevan Rudinac, Martha Larson, Alan Hanjalic International Journal of Multimedia Information Retrieval . 2013 ,第4期

机译：利用视觉概念和查询性能预测进行基于语义主题的视频检索
4. EXPLORING AUDIO SEMANTIC CONCEPTS FOR EVENT-BASED VIDEO RETRIEVAL [C] . Yipei Wang, Shourabh Rawat, Florian Metze IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：探索基于事件的视频检索的音频语义概念
5. A probablistic framework for mapping audio-visual features to high-level semantics in terms of concepts and context. [D] . Naphade, Milind Ramesh. 2001

机译：根据概念和上下文将视听功能映射到高级语义的概率框架。
6. Methods for Exploring the Semantics of the Relationships between Co-occurring UMLS Concepts [O] . Anita Burgun, Olivier Bodenreider -1

机译：共同出现的UMLS概念之间关系的语义探索方法
7. Exploring audio semantic concepts for event-based video retrieval [O] . Wang Yipei, Rawat Shourabh, Metze Florian 2014

机译：探索用于基于事件的视频检索的音频语义概念

Exploring audio semantic concepts for event-based video retrieval

摘要

著录项

相似文献

相关主题

期刊订阅