A Two-Stage Approach to Device-Robust Acoustic Scene Classification

机译：一种双级方法，可实现稳健的声学场景分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

To improve device robustness, a highly desirable key feature of a competitive data-driven acoustic scene classification (ASC) system, a novel two-stage system based on fully convolutional neural networks (CNNs) is proposed. Our two-stage system leverages on an ad-hoc score combination based on two CNN classifiers: (i) the first CNN classifies acoustic inputs into one of three broad classes, and (ii) the second CNN classifies the same inputs into one of ten finergrained classes. Three different CNN architectures are explored to implement the two-stage classifiers, and a frequency sub-sampling scheme is investigated. Moreover, novel data augmentation schemes for ASC are also investigated. Evaluated on DCASE 2020 Task 1a, our results show that the proposed ASC system attains a state-of-the-art accuracy on the development set, where our best system, a two-stage fusion of CNN ensembles, delivers a 81.9% average accuracy among multi-device test data, and it obtains a significant improvement on unseen devices. Finally, neural saliency analysis with class activation mapping (CAM) gives new insights on the patterns learnt by our models.

机译：为了提高设备鲁棒性，提出了一种竞争数据驱动的声学场景分类（ASC）系统的高度理想的关键特征，这是基于完全卷积神经网络（CNNS）的新型两级系统。我们的两阶段系统利用基于两个CNN分类器的ad-hoc得分组合来利用：（i）第一个CNN将声学输入分类为三个广播中的一个，（ii）第二个CNN将相同的输入分类为十个Finergromed课程。探索了三种不同的CNN架构来实现两阶段分类器，并研究了频率子采样方案。此外，还研究了ASC的新型数据增强方案。在DCES 2020任务1A上进行评估，结果表明，建议的ASC系统对开发集进行了最先进的准确性，其中我们最好的系统，CNN集合的两级融合，平均精度为81.9％在多设备测试数据中，它获得了看不见的设备的显着改进。最后，具有类激活映射（CAM）的神经显着性分析为我们的模型学习的模式提供了新的见解。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|845-849|共5页
会议地点
作者
Hu Hu; Chao-Han Huck Yang; Xianjun Xia; Xue Bai; Xin Tang; Yajian Wang; Shutong Niu; Li Chai; Juanjuan Li; Hongning Zhu; Feng Bao; Yuanjun Zhao; Sabato Marco Siniscalchi; Yannan Wang; Jun Du; Chin-Hui Lee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Analytical models; Image analysis; Conferences; Signal processing; Acoustics; Robustness; Data models;

机译：分析模型;图像分析;会议;信号处理;声学;鲁棒性;数据模型;

相似文献

外文文献
中文文献
专利

1. Scene classification using multiple features in a two-stage probabilistic classification framework [J] . Zhan-Li Sun, Deepu Rajan, Liang-Tien Chia Neurocomputing . 2010,第16a18期

机译：在两阶段概率分类框架中使用多个功能进行场景分类
2. Investigation of acoustic and visual features for acoustic scene classification [J] . Xie Jie, Zhu Mingying Expert Systems with Application . 2019,第JULa期

机译：声学和视觉特征的声学场景分类研究
3. Aerial scene classification via a two-stage voting fusion strategy [J] . Zhang Ying, Li Qingwu, Zhou Yaqin, Journal of electronic imaging . 2019,第5期

机译：通过两阶段投票融合策略对空中场景进行分类
4. Two-Stage Classification Learning for Open Set Acoustic Scene Classification [C] . Chunxia Ren, Shengchen Li Conference on Sound and Music Technology . 2020

机译：开放式声学场景分类的两阶段分类学习
5. A real-time neural-net computing approach to the detection and classification of underwater acoustic transients. [D] . Hemminger, Thomas Lee. 1992

机译：实时神经网络计算方法，用于水下声瞬变的检测和分类。
6. A two-stage classification approach identifies seven susceptibility genes for a simulated complex disease [O] . Nathan Pankratz 2007

机译：两阶段分类方法可识别模拟复杂疾病的七个易感基因
7. CAA-Net: Conditional Atrous CNNs with Attention for Explainable Device-robust Acoustic Scene Classification [O] . Zhao Ren, Qiuqiang Kong, Jing Han, 2020

机译：CAA-NET：有条件的CNN，注意可解释的可解释装置 - 强大的声学场景分类
8. Bayesian Approach to Acoustic Imaging and Object Classification by High Frequency Sonar. [R] . Kelly, J. G., Carpenter, R. N. 1989

机译：高频声纳的贝叶斯声成像与物体分类方法。

A Two-Stage Approach to Device-Robust Acoustic Scene Classification

摘要

著录项

相似文献

相关主题

期刊订阅