...
首页> 外文期刊>Applied Acoustics >Late fusion framework for Acoustic Scene Classification using LPCC, SCMC, and log-Mel band energies with Deep Neural Networks
【24h】

Late fusion framework for Acoustic Scene Classification using LPCC, SCMC, and log-Mel band energies with Deep Neural Networks

机译:使用LPCC,SCMC和LOG-MEL频段能量与深神经网络的后期融合框架进行声学场景分类

获取原文
获取原文并翻译 | 示例
           

摘要

A major problem in Acoustic Scene Classification (ASC) is a representation of an acoustic scene, which serves to be an important task for ASC. This study used Linear Prediction Cepstral Coefficients (LPCC) and Spectral Centroid Magnitude Cepstral Coefficients (SCMC) features along with log-Mel band energies for the representation of an acoustic scene. Deep Neural Networks (DNN) is being used to model the Acoustic Scene Classification (ASC). LPCCs are used to capture the changes in the auditory spectrum with time and SCMCs are used to capture the weighted average magnitude finely for a given acoustic scene subband. log-Mel band energies are used to capture the spectral envelopes of audio frame. The DNN architecture is used for audio track level classification. We have experimented on Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 development dataset and DCASE 2017 dataset. We carried out experiments with individual feature sets, and also performed decision level DNN score fusions for improving the performance. (C) 2020 Elsevier Ltd. All rights reserved.
机译:声场分类(ASC)中的一个主要问题是声学场景的表示,其是ASC的重要任务。该研究使用了线性预测谱系数(LPCC)和光谱质心谱系数(SCMC)特征以及用于声学场景的表示的Log-Mel频带能量。深神经网络(DNN)用于模拟声学场景分类(ASC)。 LPCC用于捕获随着时间的推移频谱的变化,并且SCCS用于捕获给定的声学场景子带的精细捕获加权平均幅度。 Log-Mel频带能量用于捕获音频帧的光谱信封。 DNN架构用于音频跟踪级别分类。我们已经尝试检测和分类声学场景和事件(DCASE)2018开发数据集和DCEAD 2017数据集。我们对各个特征集进行了实验,并且还执行了决策级别DNN评分融合以提高性能。 (c)2020 elestvier有限公司保留所有权利。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号