首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Exploring audio semantic concepts for event-based video retrieval
【24h】

Exploring audio semantic concepts for event-based video retrieval

机译:探索用于基于事件的视频检索的音频语义概念

获取原文

摘要

The audio semantic concepts (sound events) play important roles in audio-based content analysis. How to capture the semantic information effectively from the complex occurrence pattern of sound events in YouTube quality videos is a challenging problem. This paper presents a novel framework to handle the complex situation for semantic information extraction in real-world videos and evaluate through the NIST multimedia event detection task (MED). We calculate the occurrence confidence matrix of sound events and explore multiple strategies to generate clip-level semantic features from the matrix. We evaluate the performance using TRECVID2011 MED dataset. The proposed method outperforms previous HMM-based system. The late fusion experiment with the low-level features and text feature (ASR) shows that audio semantic concepts capture complementary information in the soundtrack.
机译:音频语义概念(声音事件)在基于音频的内容分析中起着重要的作用。如何有效地从YouTube优质视频中声音事件的复杂发生模式中捕获语义信息是一个具有挑战性的问题。本文提出了一个新颖的框架来处理现实世界视频中语义信息提取的复杂情况,并通过NIST多媒体事件检测任务(MED)进行评估。我们计算声音事件的发生置信度矩阵,并探索多种策略从该矩阵生成剪辑级语义特征。我们使用TRECVID2011 MED数据集评估性能。所提出的方法优于以前的基于HMM的系统。使用低级功能和文本功能(ASR)进行的后期融合实验表明,音频语义概念捕获了配乐中的补充信息。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号