首页> 外文会议>International Conference on Advanced Computing and Communication Systems >Scattering Transform of Averaged Data Augmentation for Ensemble Random Subspace Discriminant Classifiers in Audio Recognition
【24h】

Scattering Transform of Averaged Data Augmentation for Ensemble Random Subspace Discriminant Classifiers in Audio Recognition

机译:散射变换对音频识别中的集合随机子空间判别分类器的平均数据增强

获取原文

摘要

The paper presents an audio-based context recognition system using ensemble classifiers with wavelet feature extraction. The device-wise classification accuracy can be achieved with minimum computational time and improvement in classification accuracy compared to convolutional neural network (CNN) based technique via a subspace discriminant classifier. The proposed framework involves wavelet scattering feature extraction from the TAU2020 Mobile dataset via a modified data augmentation that improves the classification of unseen recording devices during training. The random sub-space discriminant classifier's device-wise classification performance on the extracted features is compared with other ensembles such as the bagged trees ensemble, boosted trees ensemble, and multiclass Naive Bayes classifier, and k-nearest neighbors. The evaluation results show around 72.6% accuracy and averaged 14.4% better than the ensemble methods. Compared to the highest performance of 76.5% in the DCASE2020 Challenge (Snapshot ensemble deep neural networks), the proposed approach offers a shorter computational time than CNN.
机译:本文介绍了一种基于音频的上下文识别系统,使用具有小波特征提取的集合分类器。与基于卷积神经网络(CNN)的技术相比,可以通过最小计算时间和分类精度的提高来实现设备明智的分类精度,通过子空间判别分类器。所提出的框架涉及通过修改的数据增强从Tau2020移动数据集提取小波散射特征提取,该数据增强改善了在训练期间的看不见的记录设备的分类。随机的子空间判别分类器的设备明智的分类性能与提取的特征相比,与袋装树集合,升级的树集合和多款幼稚贝叶斯分类器和k最近邻居等集合进行了比较。评估结果表明约72.6%的精度,比集合方法平均为14.4%。与DCES2020挑战(快照集合深神经网络)中的最高性能相比,76.5%,所提出的方法提供比CNN更短的计算时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号