首页> 外文期刊>The Journal of the Acoustical Society of America >A robust automatic birdsong phrase classification: A template-based approach
【24h】

A robust automatic birdsong phrase classification: A template-based approach

机译:强大的自动鸟鸣短语分类:一种基于模板的方法

获取原文
获取原文并翻译 | 示例
       

摘要

Automatic phrase detection systems of bird sounds are useful in several applications as they reduce the need for manual annotations. However, birdphrase detection is challenging due to limited training data and background noise. Limited data occur because of limited recordings or the existence of rare phrases. Background noise interference occurs because of the intrinsic nature of the recording environment such as wind or other animals. This paper presents a different approach to birdsong phrase classification using template-based techniques suitable even for limited training data and noisy environments. The algorithm utilizes dynamic time-warping (DTW) and prominent (high-energy) time-frequency regions of training spectrograms to derive templates. The performance of the proposed algorithm is compared with the traditional DTW and hidden Markov models (HMMs) methods under several training and test conditions. DTW works well when the data are limited, while HMMs do better when more data are available, yet they both suffer when the background noise is severe. The proposed algorithm outperforms DTW and HMMs in most training and testing conditions, usually with a high margin when the background noise level is high. The innovation of this work is that the proposed algorithm is robust to both limited training data and background noise. (C) 2016 Acoustical Society of America.
机译:鸟声自动短语检测系统在多种应用中很有用,因为它们减少了手动注释的需要。但是,由于训练数据和背景噪声有限,因此鸟语短语检测具有挑战性。由于录音有限或稀有短语的存在,导致数据有限。由于诸如风或其他动物之类的记录环境的固有性质而产生背景噪声干扰。本文提出了一种使用基于模板的技术对鸟语短语进行分类的不同方法,该方法甚至适用于有限的训练数据和嘈杂的环境。该算法利用动态时间扭曲(DTW)和训练频谱图的突出(高能)时频区域来导出模板。在几种训练和测试条件下,将所提算法的性能与传统DTW和隐马尔可夫模型(HMM)方法进行了比较。当数据有限时,DTW效果很好,而当有更多数据可用时,HMM效果更好,但是当背景噪声严重时,它们都会受到影响。所提出的算法在大多数训练和测试条件下均优于DTW和HMM,通常在背景噪声水平较高时具有较高的余量。这项工作的创新之处在于,所提出的算法对于有限的训练数据和背景噪声均具有鲁棒性。 (C)2016年美国声学学会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号