首页> 外文期刊>Multimedia Systems >Toward semantic indexing and retrieval using hierarchical audio models
【24h】

Toward semantic indexing and retrieval using hierarchical audio models

机译:使用分层音频模型实现语义索引和检索

获取原文
获取原文并翻译 | 示例
       

摘要

Semantic-level content analysis is a crucial issue in achieving efficient content retrieval and management. We propose a hierarchical approach that models the statistical characteristics of audio events over a time series to accomplish semantic context detection. Two stages, audio event and semantic context modeling, are devised to bridge the semantic gap between physical audio features and semantic concepts. In this work, hidden Markov models (HMMs) are used to model four representative audio events, i.e., gunshot, explosion, engine, and car-braking, in action movies. At the semantic-context level, Gaussian mixture models (GMMs) and ergodic HMMs are investigated to fuse the characteristics and correlations between various audio events. They provide cues for detecting gunplay and car-chasing scenes, two semantic contexts we focus on in this work. The promising experimental results demonstrate the effectiveness of the proposed approach and exhibit that the proposed framework provides a foundation in semantic indexing and retrieval. Moreover, the two fusion schemes are compared, and the relations between audio event and semantic context are studied.
机译:语义级别的内容分析是实现有效的内容检索和管理的关键问题。我们提出了一种分层方法,该模型可以对时间序列中音频事件的统计特征建模,以完成语义上下文检测。设计了两个阶段,音频事件和语义上下文建模,以弥合物理音频特征和语义概念之间的语义鸿沟。在这项工作中,隐藏的马尔可夫模型(HMM)用于对动作电影中的四个代表性音频事件进行建模,即枪声,爆炸,引擎和汽车制动。在语义上下文级别,研究了高斯混合模型(GMM)和遍历HMM,以融合各种音频事件之间的特性和相关性。它们为检测枪战和购车场景提供了线索,这是我们在本工作中重点关注的两个语义上下文。有希望的实验结果证明了该方法的有效性,并表明该框架为语义索引和检索提供了基础。此外,比较了两种融合方案,研究了音频事件与语义上下文之间的关系。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号