首页> 外文会议>Conference of the International Speech Communication Association >Indexing Multimedia Documents with Acoustic Concept Recognition Lattices
【24h】

Indexing Multimedia Documents with Acoustic Concept Recognition Lattices

机译:索引多媒体文件与声学概念识别格子

获取原文

摘要

The amount of multimedia data is increasing every day and there is a growing demand for high-accuracy multimedia retrieval systems that go beyond retrieving simple events (e.g., detecting a sport video), to more specific and hard-to-detect events (e.g., a point in a tennis match). To retrieve these complex events, audio content features play an important role since they provide complementary information to image/video features. In this paper, we propose a novel approach where we employ an HMM-based acoustic concept recognition (ACR) system and convert resulting recognition lattices into acoustic concept indexes to represent multimedia audio content. Lattice indexes are created by extracting posterior-weighted Ngram counts from the ACR lattices and they are used as features in SVM-based classification for multimedia event detection (MED) task. We evaluate the proposed approach on the NIST 2011 TRECVID MED development set, which consists of user-generated videos from the internet. Proposed approach yields an Equal Error Rate (EER) of 31.6% on this acoustically challenging dataset (on a set of 5 video events) outperforming previously proposed supervised and unsupervised approaches on the.same dataset (34.5% and 36.9% respectively).
机译:多媒体数据的数量每天都在增加,并且对高精度多媒体检索系统的需求不断增长,该系统超出了检索简单事件(例如,检测运动视频),以更具体的和难以检测的事件(例如,网球比赛中的一个点)。为了检索这些复杂的事件,音频内容特征在于它们为图像/视频特征提供互补信息,因此播放了重要作用。在本文中,我们提出了一种新颖的方法,我们采用了基于赫姆的声学概念识别(ACR)系统并将产生的识别格转换为声学概念索引来表示多媒体音频内容。通过从ACR格子中提取后加权的Ngram计数来创建格子指数,并且它们用作基于SVM的类别的多媒体事件检测(MED)任务的特征。我们评估NIST 2011 TRECVID MED开发集的建议方法,由Internet中包含用户生成的视频。所提出的方法在这一声学挑战性数据集(在一组5个视频事件上)出现了31.6%的相同错误率(eer),优于先前提出的监督和无人监督的方法,以前提出的,并分别为34.5%和36.9%)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号