首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >EXTRACTING SPOKEN AND ACOUSTIC CONCEPTS FOR MULTIMEDIA EVENT DETECTION
【24h】

EXTRACTING SPOKEN AND ACOUSTIC CONCEPTS FOR MULTIMEDIA EVENT DETECTION

机译:提取多媒体事件检测的语音和声学概念

获取原文

摘要

Because of the popularity of online videos, there has been much interest in recent years in audio processing for the improvement of online video search. In this paper, we explore using acoustic concepts and spoken concepts extracted via audio segmentation/recognition and speech recognition respectively for Multimedia Event Detection (MED). To extract spoken concepts, a segmenter trained on annotated data from user videos segments the audio into three classes: speech, music, and other sounds. The speech segments are passed to an Automatic Speech Recognition (ASR) engine, and words from the 1-best ASR output, as well as posterior-weighted word counts collected from ASR lattices, are used as features to an SVM based classifier. Acoustic concepts are extracted using the 3-gram lattice counts of two Acoustic Concept Recognition (ACR) systems trained on 7 broad classes. MED results are reported on a subset of the NIST 2011 TRECVID data. We find that spoken concepts using lattices yield a 15% relative improvement in Average Pmiss (APM) over 1-best based features. Further, the proposed spoken concepts gave a 30% relative gain in APM over the ACR-based MED system using 7 classes. Lastly, we obtain an 8% relative APM improvement after score-level fusion of both concept types, showing the effective coupling of both approaches.
机译:由于在线视频的普及,近年来在改善在线视频搜索的音频处理中有很多兴趣。在本文中,我们使用分别通过音频分割/识别和语音识别提取的声学概念和口头概念来探索多媒体事件检测(MED)。要提取口语概念,将从用户视频将音频分段为三类:语音,音乐和其他声音培训的分段器培训。语音段传递给自动语音识别(ASR)引擎,并且来自1-Best ASR输出的单词,以及从ASR格子收集的后加权字计数,用作基于SVM的分类器的特征。使用在7个广泛类别培训的两个声学概念识别(ACR)系统的3克晶格计数提取声学概念。 MED结果在NIST 2011 TRECVID数据的子集上报告。我们发现使用格子的口语概念在基于1-ext的特征上的平均PMISS(APM)的相对改善产生了15%的相对改善。此外,所提出的口语概念在基于ACR的MED系统上使用7类提供了30%的相对增益。最后,我们在两种概念类型的分数融合后获得了8%的相对APM改进,显示了两种方法的有效耦合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号