...
首页> 外文期刊>Applied Acoustics >A novel spiral pattern and 2D M4 pooling based environmental sound classification method
【24h】

A novel spiral pattern and 2D M4 pooling based environmental sound classification method

机译:一种新型螺旋模式和基于2D M4的环境声分类方法

获取原文
获取原文并翻译 | 示例
           

摘要

One of the crucial problems of the signal processing, digital forensics and machine learning is the environmental sound classification (ESC). Several ESC methods have been presented to obtain highly accurate model. In this work, a novel multileveled ESC method is presented. The presented ESC method uses two novel algorithms namely Spiral Pattern and two dimensional maximum, minimum, median and mean (2D-M4) pooling. By using these methods (Spiral Pattern and 2D-M4 pooling), 9 level feature generation approach is presented. Since the proposed Spiral Pattern has nine arrows, it extracts 9 and 18 bits using signum and ternary functions respectively. As a result, 1536 features are extracted in each level and totally 15,360 features are generated using from 0th to 9th levels. In order to select the discriminative features, neighbourhood component analysis (NCA) is used and 700 most distinctive features are selected. In the classification phase, deep neural network is trained and tested with the ESC-10 and ESC-50 datasets. 98.75% and 85.75% average classification accuracies were achieved with 10-folds cross validation for ESC-10 and ESC-50 datasets respectively. The experimental results reveal that the proposed Spiral Pattern and 2D-M4 pooling based ESC method is superior than the human auditory system (HAS) for environmental sound classification. (C) 2020 Elsevier Ltd. All rights reserved.
机译:信号处理,数字取证和机器学习的关键问题之一是环境声音分类(ESC)。已经提出了几种ESC方法以获得高精度的模型。在这项工作中,提出了一种新颖的多级ESC方法。所呈现的ESC方法使用两种新颖的算法,即螺旋模式和二维最大,最小,中值和平均值(2D-M4)汇集。通过使用这些方法(螺旋模式和2D-M4池),提出了9个级别的特征生成方法。由于所提出的螺旋模式具有九个箭头,因此分别利用Signum和三元功能提取9和18位。结果,在每个级别中提取1536个特征,使用0到第9级生成完整的15,360个特征。为了选择鉴别特征,使用邻域分量分析(NCA),选择700个最独特的功能。在分类阶段,深神经网络培训并用ESC-10和ESC-50数据集进行培训和测试。对于ESC-10和ESC-50数据集的10倍交叉验证,实现了98.75%和85.75%的平均分类精度。实验结果表明,所提出的螺旋模式和基于2D-M4的ESC方法优于人类听觉系统(具有环境声音分类。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号