首页> 外文期刊>Applied Acoustics >Urban sound classification based on 2-order dense convolutional network using dual features
【24h】

Urban sound classification based on 2-order dense convolutional network using dual features

机译:基于双重特征的二阶密集卷积网络的城市声音分类

获取原文
获取原文并翻译 | 示例
           

摘要

Audio carry a large amount of life scenes and physical events in the city, therefore, developing deep learning approach to automatically extract this information has huge potential and application in building smart-city. In this paper, a novel urban sound event classification model based on 2-order dense convolutional network using dual features is proposed, which aims at the problems of insufficient classification accuracy and adaptability of current models. Firstly, the brief introduction of urban sound classification development and application is presented in Section 1. Then, the method of feature extraction and add noise environment is respectively introduced in Section 2. Moreover, a new network structure referred to as 2-order dense convolutional network (shorten as 2-DenseNet) and its algorithm are presented in Section 3. Meanwhile, an urban sound event classification model based on 2-DenseNet using dual features, i.e. D-2-DenseNet is proposed in this paper. Theoretically, D-2-DenseNet not only can accelerate the convergence speed when compared with DenseNet, but also can enhance the classification accuracy and guarantee a good generalization ability owing to the fact that dual features fusion is exploited in the proposed model. Finally, in order to validate advantages of the D-2-DenseNet, this new model is respectively exploited in the urban sound event classification based on UrbanSound8K and Dcase2016 datasets. The experimental result shows that the accuracy of the network is respectively 84.83% and 85.17%, which has increase up to 13.81% and 7.07% compared with baseline. Compared with single feature network, the classification accuracy of D-2-DenseNet has increased by 3.35% and 4.78% respectively in noise environment. (C) 2020 Elsevier Ltd. All rights reserved.
机译:音频在城市中承载着大量的生活场景和自然事件,因此,开发深度学习方法来自动提取这些信息具有巨大的潜力,并在建设智慧城市中具有巨大的潜力。针对现有分类模型的分类精度和适应性不足的问题,提出了一种基于双重特征的二阶密集卷积网络的城市声音事件分类模型。首先在第1节中简要介绍了城市声音分类的发展和应用,然后在第2节中分别介绍了特征提取和添加噪声环境的方法。此外,一种称为二阶密集卷积的新网络结构第三部分介绍了网络(简称为2-DenseNet)及其算法。同时,本文提出了一种基于2-DenseNet的具有双重特征的城市声音事件分类模型,即D-2-DenseNet。从理论上讲,D-2-DenseNet与DenseNet相比,不仅可以加快收敛速度​​,而且由于在模型中采用了双重特征融合,因此可以提高分类的准确性,并保证良好的泛化能力。最后,为了验证D-2-DenseNet的优势,在基于UrbanSound8K和Dcase2016数据集的城市声音事件分类中分别使用了该新模型。实验结果表明,该网络的准确度分别为84.83%和85.17%,与基线相比提高了13.81%和7.07%。与单特征网络相比,在噪声环境下,D-2-DenseNet的分类精度分别提高了3.35%和4.78%。 (C)2020 Elsevier Ltd.保留所有权利。

著录项

  • 来源
    《Applied Acoustics》 |2020年第7期|107243.1-107243.9|共9页
  • 作者

  • 作者单位

    Jiangnan Univ Sch Mech Engn Wuxi 214122 Jiangsu Peoples R China|Jiangsu Key Lab Adv Food Mfg Equipment & Technol Wuxi 214122 Jiangsu Peoples R China;

    Suzhou Vocat Inst Ind Technol Suzhou 215104 Jiangsu Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Urban sound classification; 2-DenseNet; Dual features fusion; D-2-DenseNet;

    机译:城市声音分类;2-密集网;双重功能融合;D-2-密集网;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号