首页> 外文期刊>Applied Acoustics >Multi-scale semantic feature fusion and data augmentation for acoustic scene classification
【24h】

Multi-scale semantic feature fusion and data augmentation for acoustic scene classification

机译:多尺度语义特征融合和数据扩充用于声场分类

获取原文
获取原文并翻译 | 示例
       

摘要

This paper investigates a multi-scale semantic feature fusion and data augmentation approach for deep convolutional neural network (CNN) based acoustic scene classification. To ensemble the multi-scale semantic information of CNN and improve the performance of acoustic scene classification, a multi scale feature fusion framework, which consists of a simplified Xception backbone and a semantic feature fusion strategy, is presented. A novel label smoothing mixup data augmentation method, which is a generalization of mixup and label smoothing, is proposed to alleviate the over-confident problem of network training. A spatial-mixup technique is presented to generate meaningful mixup virtual data for acoustic scene classification. Extensive experiments on synthetic data and real acoustic scene classification dataset demonstrate that both multi-scale semantic feature fusion and label smoothing spatial-mixup data augmentation are effective for improving the acoustic scene classification performance of a deep neural network. (C) 2020 Elsevier Ltd. All rights reserved.
机译:本文研究了基于深度卷积神经网络(CNN)的声学场景分类的多尺度语义特征融合和数据增强方法。为了融合CNN的多尺度语义信息并提高声学场景分类的性能,提出了一种由简化的Xception主干和语义特征融合策略组成的多尺度特征融合框架。提出了一种新的标签平滑混合数据扩充方法,它是对混合和标签平滑的一种概括,旨在缓解网络训练中过于自信的问题。提出了一种空间混合技术来生成有意义的混合虚拟数据,以进行声学场景分类。对合成数据和真实声学场景分类数据集进行的大量实验表明,多尺度语义特征融合和标签平滑空间混合数据扩充都可以有效地改善深度神经网络的声学场景分类性能。 (C)2020 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号