Urban sound classification based on 2-order dense convolutional network using dual features

首页> 外文期刊>Applied Acoustics >Urban sound classification based on 2-order dense convolutional network using dual features

【24h】

Urban sound classification based on 2-order dense convolutional network using dual features

机译：基于双重特征的二阶密集卷积网络的城市声音分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio carry a large amount of life scenes and physical events in the city, therefore, developing deep learning approach to automatically extract this information has huge potential and application in building smart-city. In this paper, a novel urban sound event classification model based on 2-order dense convolutional network using dual features is proposed, which aims at the problems of insufficient classification accuracy and adaptability of current models. Firstly, the brief introduction of urban sound classification development and application is presented in Section 1. Then, the method of feature extraction and add noise environment is respectively introduced in Section 2. Moreover, a new network structure referred to as 2-order dense convolutional network (shorten as 2-DenseNet) and its algorithm are presented in Section 3. Meanwhile, an urban sound event classification model based on 2-DenseNet using dual features, i.e. D-2-DenseNet is proposed in this paper. Theoretically, D-2-DenseNet not only can accelerate the convergence speed when compared with DenseNet, but also can enhance the classification accuracy and guarantee a good generalization ability owing to the fact that dual features fusion is exploited in the proposed model. Finally, in order to validate advantages of the D-2-DenseNet, this new model is respectively exploited in the urban sound event classification based on UrbanSound8K and Dcase2016 datasets. The experimental result shows that the accuracy of the network is respectively 84.83% and 85.17%, which has increase up to 13.81% and 7.07% compared with baseline. Compared with single feature network, the classification accuracy of D-2-DenseNet has increased by 3.35% and 4.78% respectively in noise environment. (C) 2020 Elsevier Ltd. All rights reserved.

机译：音频在城市中承载着大量的生活场景和自然事件，因此，开发深度学习方法来自动提取这些信息具有巨大的潜力，并在建设智慧城市中具有巨大的潜力。针对现有分类模型的分类精度和适应性不足的问题，提出了一种基于双重特征的二阶密集卷积网络的城市声音事件分类模型。首先在第1节中简要介绍了城市声音分类的发展和应用，然后在第2节中分别介绍了特征提取和添加噪声环境的方法。此外，一种称为二阶密集卷积的新网络结构第三部分介绍了网络（简称为2-DenseNet）及其算法。同时，本文提出了一种基于2-DenseNet的具有双重特征的城市声音事件分类模型，即D-2-DenseNet。从理论上讲，D-2-DenseNet与DenseNet相比，不仅可以加快收敛速度，而且由于在模型中采用了双重特征融合，因此可以提高分类的准确性，并保证良好的泛化能力。最后，为了验证D-2-DenseNet的优势，在基于UrbanSound8K和Dcase2016数据集的城市声音事件分类中分别使用了该新模型。实验结果表明，该网络的准确度分别为84.83％和85.17％，与基线相比提高了13.81％和7.07％。与单特征网络相比，在噪声环境下，D-2-DenseNet的分类精度分别提高了3.35％和4.78％。（C）2020 Elsevier Ltd.保留所有权利。

著录项

来源
《Applied Acoustics》 |2020年第7期|107243.1-107243.9|共9页
作者

展开▼
作者单位

Jiangnan Univ Sch Mech Engn Wuxi 214122 Jiangsu Peoples R China|Jiangsu Key Lab Adv Food Mfg Equipment & Technol Wuxi 214122 Jiangsu Peoples R China;

Suzhou Vocat Inst Ind Technol Suzhou 215104 Jiangsu Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Urban sound classification; 2-DenseNet; Dual features fusion; D-2-DenseNet;

机译：城市声音分类;2-密集网;双重功能融合;D-2-密集网;

相似文献

外文文献
中文文献
专利

1. Heart sound classification based on log Mel-frequency spectral coefficients features and convolutional neural networks [J] . Kui Haoran, Pan Jiahua, Zong Rong, Biomedical signal processing and control . 2021,第Auga期

机译：基于日志熔体频谱系数特征和卷积神经网络的心声分类
2. Hyperspectral remote sensing image classification based on dense residual three-dimensional convolutional neural network [J] . Suting Chen, Meng Jin, Jie Ding Multimedia Tools and Applications . 2021,第2期

机译：基于密集残余三维卷积神经网络的高光谱遥感图像分类
3. Human activity classification based on sound recognition and residual convolutional neural network [J] . Automation in construction . 2020,第Juna期

机译：基于声音识别和残差卷积神经网络的人类活动分类
4. Urban Sound Classification Using Convolutional Neural Network and Long Short Term Memory Based on Multiple Features [C] . Joy Krishan Das, Arka Ghosh, Abhijit Kumar Pal, International Conference On Intelligent Computing in Data Sciences . 2020

机译：城市声音分类使用卷积神经网络和基于多个功能的长期内存
5. Investigation of Convolutional Neural Network Architectures for Image-based Feature Learning and Classification. [D] . Ren, Johnny. 2016

机译：基于图像的特征学习和分类的卷积神经网络体系结构研究。
6. Skin Lesion Classification Using Densely Connected Convolutional Networks with Attention Residual Learning [O] . Jing Wu, Wei Hu, Yuan Wen, 2020

机译：皮肤病变分类使用密集连接的卷积网络引起剩余学习
7. Densely Feature Fusion Based on Convolutional Neural Networks for Motor Imagery EEG Classification [O] . Donglin Li, Jianhui Wang, Jiacan Xu, 2019

机译：基于卷积神经网络的电机图像EEG分类密集融合
8. Keypoint Density-Based Region Proposal for Fine-Grained Object Detection and Classification Using Regions with Convolutional Neural Network Features. [R] . Turner, J. T., Gupta, K., Morris, B., 2015

机译：基于关键点密度的区域提议，用于使用具有卷积神经网络特征的区域进行细粒度目标检测和分类。

Urban sound classification based on 2-order dense convolutional network using dual features

摘要

著录项

相似文献

相关主题

期刊订阅