...
首页> 外文期刊>Ecological informatics: an international journal on ecoinformatics and computational ecology >Data augmentation approaches for improving animal audio classification
【24h】

Data augmentation approaches for improving animal audio classification

机译:改善动物音频分类的数据增强方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

In this paper we present ensembles of classifiers for automated animal audio classification, exploiting different data augmentation techniques for training Convolutional Neural Networks (CNNs). The specific animal audio classification problems are i) birds and ii) cat sounds, whose datasets are freely available. We train five different CNNs on the original datasets and on their versions augmented by four augmentation protocols, working on the raw audio signals or their representations as spectrograms. We compared our best approaches with the state of the art, showing that we obtain the best recognition rate on the same datasets, without ad hoc parameter optimization. Our study shows that different CNNs can be trained for the purpose of animal audio classification and that their fusion works better than the stand-alone classifiers. To the best of our knowledge this is the largest study on data augmentation for CNNs in animal audio classification audio datasets using the same set of classifiers and parameters. Our MATLAB code is available at https://github.com/LorisNanni.
机译:在本文中,我们向自动化动物音频分类提供了分类器的集成,利用用于训练卷积神经网络(CNN)的不同数据增强技术。特定的动物音频分类问题是i)鸟类和ii)猫的声音,其数据集自由可用。我们在原始数据集上培训五个不同的CNN,并在其版本上由四个增强协议增强,在原始音频信号或其表示作为频谱图。我们将我们的最佳方法与现有技术进行了比较,表明我们在没有临时参数优化的情况下获得相同数据集的最佳识别率。我们的研究表明,可以针对动物音频分类的目的培训不同的CNN,并且其融合优于独立的分类器。据我们所知,这是使用相同的分类器和参数的动物音频分类音频数据集中的CNNS数据增强的最大研究。我们的matlab代码可在https://github.com/lorisnanni获得。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号