...
首页> 外文期刊>Ecological informatics: an international journal on ecoinformatics and computational ecology >Handcrafted features and late fusion with deep learning for bird sound classification
【24h】

Handcrafted features and late fusion with deep learning for bird sound classification

机译:手工制作的功能和晚期融合,深入学习鸟类分类

获取原文
获取原文并翻译 | 示例

摘要

Automated classification of calling bird species is useful for large-scale temporal and spatial environmental monitoring. In this paper, we investigate acoustic features, visual features, and deep learning for bird sound classification. For the deep learning approach, the Convolutional Neural Network layers are used for learning generalized features and dimension reduction, while a conventional fully connected layer is used for classification. Then, an unified end-to-end model is built by combing those three layers for classifying calling bird species. For visual and acoustic features, two traditional classifiers are compared to classify the bird sounds. Experimental results on 14 bird species indicate that our proposed deep learning method can achieve the best F1-score 94.36%, which is higher than using the acoustic features approach (88.97%) and using the visual features approach (88.87%). To further improve the classification performance, a class-based late fusion method is explored. Our final best classification F1-score is 95.95%, which is obtained by the late fusion of the acoustic features approach, the visual features approach, and deep learning.
机译:呼叫鸟类的自动分类对于大规模的时间和空间环境监测有用。在本文中,我们调查了声音分类的声学特征,视觉功能和深度学习。对于深度学习方法,卷积神经网络层用于学习广义特征和尺寸减小,而传统的完全连接层用于分类。然后,通过梳理那些用于分类调用鸟类的三层来构建统一的端到端模型。对于视觉和声学特征,比较两个传统分类器,以分类鸟类声音。 14只鸟物种的实验结果表明,我们提出的深度学习方法可以实现最佳F1分数94.36%,其高于使用声学特征方法(88.97%)并使用视觉特征方法(88.87%)。为了进一步提高分类性能,探讨了基于类的后期融合方法。我们最终的最佳分类F1分数为95.95%,由声学特征方法的后期融合,视觉特征方法和深度学习获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号