首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Bird-phrase segmentation and verification: A noise-robust template-based approach
【24h】

Bird-phrase segmentation and verification: A noise-robust template-based approach

机译:鸟语短语分割和验证:一种基于抗噪模板的方法

获取原文

摘要

In this paper, we present a birdsong-phrase segmentation and verification algorithm that is robust to limited training data, class variability, and noise. The algorithm comprises a noise-robust, Dynamic-Time-Warping (DTW)-based segmentation and a discriminative classifier for outlier rejection. The algorithm utilizes DTW and prominent (high energy) time-frequency regions of training spectrograms to derive a reliable noise-robust template for each phrase class. The resulting template is then used for segmenting continuous recordings to obtain segment candidates whose spectrogram amplitudes in the prominent regions are used as features to a Support Vector Machine (SVM). The algorithm is evaluated on the Cassin's Vireo recordings; our proposed system yields low Equal Error Rates (EER) and segment boundaries that are close to those obtained from manual annotations and, is better than energy or entropy-based birdsong segmentation algorithms. In the presence of additive noise (−10 to 10 dB SNR), the proposed phrase detection system does not degrade as significantly as the other algorithms do.
机译:在本文中,我们提出了一种鸟语短语分割和验证算法,该算法对有限的训练数据,类变异性和噪声具有鲁棒性。该算法包括基于噪声稳健,基于动态时间扭曲(DTW)的分段和用于异常值剔除的判别式分类器。该算法利用DTW和训练频谱图的显着(高能量)时频区域为每个短语类别推导可靠的抗噪模板。然后,将所得的模板用于对连续记录进行分段,以获得分段候选者,其突出区域中的频谱图幅度用作支持向量机(SVM)的特征。该算法在Cassin的Vireo录音中进行评估;我们提出的系统产生的低均等错误率(EER)和分段边界接近于从手动注释获得的边界,并且优于基于能量或基于熵的Birdong分段算法。在存在附加噪声(-10至10 dB SNR)的情况下,建议的短语检测系统不会像其他算法那样严重退化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号