首页> 外文期刊>EURASIP journal on advances in signal processing >Exploiting Temporal Feature Integration for Generalized Sound Recognition
【24h】

Exploiting Temporal Feature Integration for Generalized Sound Recognition

机译:利用时间特征集成进行广义声音识别

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

This paper presents a methodology that incorporates temporal feature integration for automated generalized sound recognition. Such a system can be of great use to scene analysis and understanding based on the acoustic modality. The performance of three feature sets based on Mel filterbank, MPEG-7 audio protocol, and wavelet decomposition is assessed. Furthermore we explore the application of temporal integration using the following three different strategies: (a) short-term statistics, (b) spectral moments, and (c) autoregressive models. The experimental setup is thoroughly explained and based on the concurrent usage of professional sound effects collections. In this way we try to form a representative picture of the characteristics of ten sound classes. During the first phase of our implementation, the process of audio classification is achieved through statistical models (HMMs) while a fusion scheme that exploits the models constructed by various feature sets provided the highest average recognition rate. The proposed system not only uses diverse groups of sound parameters but also employs the advantages of temporal feature integration.
机译:本文提出了一种方法,该方法结合了时间特征集成以实现自动化的广义声音识别。这样的系统对于基于声学模态的场景分析和理解很有用。评估了基于Mel滤波器组,MPEG-7音频协议和小波分解的三个功能集的性能。此外,我们使用以下三种不同的策略探索时间积分的应用:(a)短期统计,(b)谱矩和(c)自回归模型。实验设置已得到全面解释,并基于专业音效合集的同时使用。通过这种方式,我们尝试形成十种声音类别的特征的代表性图片。在我们实施的第一阶段,音频分类的过程是通过统计模型(HMM)实现的,而利用各种功能集构建的模型的融合方案则提供了最高的平均识别率。所提出的系统不仅使用各种声音参数​​组,而且还利用了时间特征集成的优点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号