首页> 外文会议>European Signal Processing Conference >Acoustic Event Classification Using Multi-Resolution HMM
【24h】

Acoustic Event Classification Using Multi-Resolution HMM

机译:使用多分辨率HMM的声音事件分类

获取原文

摘要

Real-world acoustic events span a wide range of time and frequency resolutions, from short clicks to longer tonals. This is a challenge for the hidden Markov model (HMM), which uses a fixed segmentation and feature extraction, forcing a compromise between time and frequency resolution. The multiresolution HMM (MR-HMM) is an extension of the HMM that assumes not only an underlying (hidden) random state sequence, but also an underlying random segmentation, with segments spanning a wide range of sizes and processed using a variety of feature extraction methods. It is shown that the MR-HMM alone, as an acoustic event classifier, has performance comparable to state of the art discriminative classifiers on three open data sets. However, as a generative classifier, the MR-HMM models the underlying data generation process and can generate synthetic data, allowing weaknesses of individual class models to be discovered and corrected. To demonstrate this point, the MR-HMM is combined with auxiliary features that capture temporal information, resulting in significantly improved performance.
机译:真实的声音事件涵盖了很宽的时间和频率分辨率范围,从短促的点击声到更长的音调。对于隐马尔可夫模型(HMM)来说,这是一个挑战,该模型使用固定的分段和特征提取,从而在时间和频率分辨率之间做出折衷。多分辨率HMM(MR-HMM)是HMM的扩展,它不仅假定基础(隐藏)随机状态序列,还假定基础随机分段,其中分段的大小范围很广,并使用各种特征提取进行处理方法。结果表明,单独的MR-HMM作为声音事件分类器,其性能可与三个开放数据集上的最新判别器相媲美。但是,作为生成分类器,MR-HMM对基础数据生成过程进行建模,并且可以生成综合数据,从而可以发现和纠正各个类模型的弱点。为了证明这一点,MR-HMM与捕获时间信息的辅助功能相结合,从而显着提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号