首页> 外文期刊>Pattern recognition letters >Overlapping sound event recognition using local spectrogram features and the generalised hough transform
【24h】

Overlapping sound event recognition using local spectrogram features and the generalised hough transform

机译:使用局部频谱图特征和广义霍夫变换进行重叠的声音事件识别

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we address the challenging task of simultaneous recognition of overlapping sound events from single channel audio. Conventional frame-based methods are not well suited to the problem, as each time frame contains a mixture of information from multiple sources. Missing feature masks are able to improve the recognition in such cases, but are limited by the accuracy of the mask, which is a non-trivial problem. In this paper, we propose an approach based on Local Spectrogram Features (LSFs) which represent local spectral information that is extracted from the two-dimensional region surrounding "keypoints" detected in the spectrogram. The keypoints are designed to locate the sparse, discriminative peaks in the spectrogram, such that we can model sound events through a set of representative LSF clusters and their occurrences in the spectrogram. To recognise overlapping sound events, we use a Generalised Hough Transform (GHT) voting system, which sums the information over many independent keypoints to produce onset hypotheses, that can detect any arbitrary combination of sound events in the spectrogram. Each hypothesis is then scored against the class distribution models to recognise the existence of the sound in the spectrogram. Experiments on a set of five overlapping sound events, in the presence of non-stationary background noise, demonstrate the potential of our approach.
机译:在本文中,我们解决了从单通道音频中同时识别重叠声音事件的艰巨任务。传统的基于帧的方法不太适合该问题,因为每个时间帧都包含来自多个源的信息的混合。在这种情况下,缺少特征蒙版可以提高识别度,但受到蒙版精度的限制,这是一个不小的问题。在本文中,我们提出了一种基于局部频谱图特征(LSF)的方法,该特征表示从频谱图中检测到的“关键点”周围的二维区域提取的局部频谱信息。设计关键点是为了定位频谱图中的稀疏,有区别的峰值,以便我们可以通过一组代表性的LSF群集及其在频谱图中的出现来对声音事件进行建模。为了识别重叠的声音事件,我们使用了通用霍夫变换(GHT)投票系统,该系统将许多独立关键点上的信息相加,以产生起始假设,该假设可以检测频谱图中声音事件的任意组合。然后,根据类别分布模型对每个假设进行评分,以识别声谱图中声音的存在。在存在非平稳背景噪声的情况下,对一组五个重叠的声音事件进行的实验证明了我们方法的潜力。

著录项

  • 来源
    《Pattern recognition letters》 |2013年第9期|1085-1093|共9页
  • 作者

    J. Dennis; H.D. Tran; E.S. Chng;

  • 作者单位

    Institute for Infocomm Research, 1 Fusionopolis Way, #08-01 South Tower Connexis, Singapore 138632, Singapore,School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore;

    Institute for Infocomm Research, 1 Fusionopolis Way, #08-01 South Tower Connexis, Singapore 138632, Singapore;

    School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    overlapping sound event recognition; local spectrogram features; keypoint detectionvgeneralised hough transform;

    机译:重叠的声音事件识别;局部频谱图特征;关键点检测;广义霍夫变换;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号