首页> 外文期刊>Pattern recognition letters >Real-world acoustic event detection
【24h】

Real-world acoustic event detection

机译:真实的声音事件检测

获取原文
获取原文并翻译 | 示例
           

摘要

Acoustic Event Detection (AED) aims to identify both timestamps and types of events in an audio stream. This becomes very challenging when going beyond restricted highlight events and well controlled recordings. We propose extracting discriminative features for AED using a boosting approach, which outperform classical speech perceptual features, such as Mel-frequency Cepstral Coefficients and log frequency filterbank parameters. We propose leveraging statistical models better fitting the task. First, a tandem connectionist-HMM approach combines the sequence modeling capabilities of the HMM with the high-accuracy context-dependent discriminative capabilities of an artificial neural network trained using the minimum cross entropy criterion. Second, an SVM-GMM-supervector approach uses noise-adaptive kernels better approximating the KL divergence between feature distributions in different audio segments. Experiments on the CLEAR 2007 AED Evaluation set-up demonstrate that the presented features and models lead to over 45% relative performance improvement, and also outperform the best system in the CLEAR AED Evaluation, on detection of twelve general acoustic events in a real seminar environment.
机译:声音事件检测(AED)旨在识别音频流中的时间戳和事件类型。当超出受限制的精彩场面事件和受到良好控制的录音时,这变得非常具有挑战性。我们建议使用增强方法来提取AED的判别特征,该特征优于经典语音感知特征,例如Mel频率倒谱系数和对数频率滤波器组参数。我们建议利用统计模型更好地适合任务。首先,串联连接HMM方法将HMM的序列建模功能与使用最小交叉熵准则训练的人工神经网络的高精度上下文相关判别能力相结合。其次,SVM-GMM-超向量方法使用了自适应噪声的内核,可以更好地近似不同音频片段中特征分布之间的KL散度。在CLEAR 2007 AED评估设置上进行的实验表明,在实际的研讨会环境中检测到十二种一般声学事件时,所提供的功能和模型可以使相对性能提高45%以上,并且优于CLEAR AED评估中的最佳系统。 。

著录项

  • 来源
    《Pattern recognition letters》 |2010年第12期|P.1543-1551|共9页
  • 作者单位

    Beckman Institute of Advanced Science and Technology, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA;

    rnBeckman Institute of Advanced Science and Technology, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA;

    rnBeckman Institute of Advanced Science and Technology, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA;

    rnBeckman Institute of Advanced Science and Technology, Department of Electrical and Computer Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    acoustic event detection; feature selection; hidden markov model; artificial neural network; tandem model; gaussian mixture model supervector;

    机译:声音事件检测;特征选择;隐藏的马尔可夫模型;人工神经网络;串联模型高斯混合模型超向量;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号