Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

首页> 外文期刊>Applied Acoustics >Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

【24h】

Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

机译：多尺度语义特征融合和数据扩充用于声场分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper investigates a multi-scale semantic feature fusion and data augmentation approach for deep convolutional neural network (CNN) based acoustic scene classification. To ensemble the multi-scale semantic information of CNN and improve the performance of acoustic scene classification, a multi scale feature fusion framework, which consists of a simplified Xception backbone and a semantic feature fusion strategy, is presented. A novel label smoothing mixup data augmentation method, which is a generalization of mixup and label smoothing, is proposed to alleviate the over-confident problem of network training. A spatial-mixup technique is presented to generate meaningful mixup virtual data for acoustic scene classification. Extensive experiments on synthetic data and real acoustic scene classification dataset demonstrate that both multi-scale semantic feature fusion and label smoothing spatial-mixup data augmentation are effective for improving the acoustic scene classification performance of a deep neural network. (C) 2020 Elsevier Ltd. All rights reserved.

机译：本文研究了基于深度卷积神经网络（CNN）的声学场景分类的多尺度语义特征融合和数据增强方法。为了融合CNN的多尺度语义信息并提高声学场景分类的性能，提出了一种由简化的Xception主干和语义特征融合策略组成的多尺度特征融合框架。提出了一种新的标签平滑混合数据扩充方法，它是对混合和标签平滑的一种概括，旨在缓解网络训练中过于自信的问题。提出了一种空间混合技术来生成有意义的混合虚拟数据，以进行声学场景分类。对合成数据和真实声学场景分类数据集进行的大量实验表明，多尺度语义特征融合和标签平滑空间混合数据扩充都可以有效地改善深度神经网络的声学场景分类性能。（C）2020 Elsevier Ltd.保留所有权利。

著录项

来源
《Applied Acoustics》 |2020年第6期|107238.1-107238.10|共10页
作者

展开▼
作者单位

Chongqing Univ MOE Key Lab Optoelect Technol & Syst Chongqing 400044 Peoples R China;

Chongqing Univ Sci & Technol Sch Elect Engn Chongqing 401331 Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Multi-scale feature learning; Convolutional neural networks; Data augmentation; Acoustic scene classification (ASC); Machine listening;

机译：多尺度特征学习;卷积神经网络;数据扩充;声场分类（ASC）;机器听;
入库时间 2022-08-18 05:19:01

相似文献

外文文献
中文文献
专利

1. Scene classification of remote sensing image based on deep network and multi-scale features fusion [J] . Yang Zhou, Mu Xiao-dong, Zhao Feng-an Optik: Zeitschrift fur Licht- und Elektronenoptik: = Journal for Light-and Electronoptic . 2018,第期

机译：基于深网络和多尺度特征融合的遥感图像场景分类
2. LDA-based data augmentation algorithm for acoustic scene classification [J] . Leng Yan, Zhao Weiwei, Lin Chan, Knowledge-Based Systems . 2020,第May11期

机译：基于LDA的声学场景分类数据增强算法
3. Semantic Classification of Heterogeneous Urban Scenes Using Intrascene Feature Similarity and Interscene Semantic Dependency [J] . Zhang Xiuyuan, Du Shihong, Wang Yi-Chen Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2015,第5期

机译：基于场景内特征相似度和场景间语义相关性的异构城市场景语义分类
4. Acoustic scene classification using convolutional neural networks and multi-scale multi-feature extraction [C] . An Dang, Toan H. Vu, Jia-Ching Wang IEEE International Conference on Consumer Electronics . 2018

机译：使用卷积神经网络和多尺度多特征提取进行声音场景分类
5. Feature Extraction and Fusion for Supervised and Semi-supervised Classification: Application to fMRI and LTM Data. [D] . Du, Wei. 2014

机译：监督和半监督分类的特征提取和融合：应用于fMRI和LTM数据。
6. Multi-Scale Spatial Concatenations of Local Features in Natural Scenes and Scene Classification [O] . Xiaoyuan Zhu, Zhiyong Yang -1

机译：自然场景和场景分类中局部特征的多尺度空间级联
7. Bayesian fusion of camera metadata cues in semantic scene classification [O] . Matthew Boutell, Jiebo Luo 2004

机译：语义场景分类中相机元数据线索的贝叶斯融合

Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

摘要

著录项

相似文献

相关主题

期刊订阅