Late fusion framework for Acoustic Scene Classification using LPCC, SCMC, and log-Mel band energies with Deep Neural Networks

Paseddula Chandrasekhar; Gangashetty Suryakanth V

首页> 外文期刊>Applied Acoustics >Late fusion framework for Acoustic Scene Classification using LPCC, SCMC, and log-Mel band energies with Deep Neural Networks

【24h】

Late fusion framework for Acoustic Scene Classification using LPCC, SCMC, and log-Mel band energies with Deep Neural Networks

机译：使用LPCC，SCMC和LOG-MEL频段能量与深神经网络的后期融合框架进行声学场景分类

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

A major problem in Acoustic Scene Classification (ASC) is a representation of an acoustic scene, which serves to be an important task for ASC. This study used Linear Prediction Cepstral Coefficients (LPCC) and Spectral Centroid Magnitude Cepstral Coefficients (SCMC) features along with log-Mel band energies for the representation of an acoustic scene. Deep Neural Networks (DNN) is being used to model the Acoustic Scene Classification (ASC). LPCCs are used to capture the changes in the auditory spectrum with time and SCMCs are used to capture the weighted average magnitude finely for a given acoustic scene subband. log-Mel band energies are used to capture the spectral envelopes of audio frame. The DNN architecture is used for audio track level classification. We have experimented on Detection and Classification of Acoustic Scenes and Events (DCASE) 2018 development dataset and DCASE 2017 dataset. We carried out experiments with individual feature sets, and also performed decision level DNN score fusions for improving the performance. (C) 2020 Elsevier Ltd. All rights reserved.

机译：声场分类（ASC）中的一个主要问题是声学场景的表示，其是ASC的重要任务。该研究使用了线性预测谱系数（LPCC）和光谱质心谱系数（SCMC）特征以及用于声学场景的表示的Log-Mel频带能量。深神经网络（DNN）用于模拟声学场景分类（ASC）。 LPCC用于捕获随着时间的推移频谱的变化，并且SCCS用于捕获给定的声学场景子带的精细捕获加权平均幅度。 Log-Mel频带能量用于捕获音频帧的光谱信封。 DNN架构用于音频跟踪级别分类。我们已经尝试检测和分类声学场景和事件（DCASE）2018开发数据集和DCEAD 2017数据集。我们对各个特征集进行了实验，并且还执行了决策级别DNN评分融合以提高性能。（c）2020 elestvier有限公司保留所有权利。

著录项

来源
《Applied Acoustics》 |2021年第1期|107568.1-107568.12|共12页
作者
Paseddula Chandrasekhar; Gangashetty Suryakanth V;
展开▼
作者单位

Int Inst Informat Technol Hyderabad Speech Proc Lab Hyderabad India;

Int Inst Informat Technol Hyderabad Speech Proc Lab Hyderabad India;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Linear Prediction Cepstral Coefficients; Spectral Centroid Magnitude Cepstral; Coefficients; log-Mel band energies; Acoustic Scene Classification; Deep Neural Networks;

机译：线性预测抗痉挛系数;光谱分子幅度颅穴;系数;Log-Mel频段能量;声场景分类;深神经网络;

相似文献

外文文献
专利

1. A novel acoustic scene classification model using the late fusion of convolutional neural networks and different ensemble classifiers [J] . Alamir Mahmoud A. Applied Acoustics . 2021,第Apra期

机译：一种新的声学场景分类模型，使用卷积神经网络的后期融合和不同的集合分类
2. Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks [J] . Zhang Pengyuan, Chen Hangting, Bai Haichuan, Chinese Journal of Electronics . 2019,第6期

机译：具有深度神经网络的深度散射光谱，用于声学场景分类任务
3. Deep Scattering Spectra with Deep Neural Networks for Acoustic Scene Classification Tasks [J] . ZHANG Pengyuan, CHEN Hangting, BAI Haichuan, 中国电子杂志（英文版） . 2019,第006期

机译：具有深度神经网络的深度散射光谱用于声场分类任务
4. Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths [C] . Mark D. McDonnell, Wei Gao IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：使用深度残差网络和分离的高频和低频路径的后期融合的声学场景分类
5. A Framework for Enhancing Speaker Age and Gender Classification by Using a New Feature Set and Deep Neural Network Architectures [D] . Abumallouh, Arafat. 2017

机译：通过使用新功能集和深度神经网络体系结构提高演讲者年龄和性别分类的框架
6. A Two-Stream Deep Fusion Framework for High-Resolution Aerial Scene Classification [O] . Yunlong Yu, Fuxian Liu 2018

机译：用于高分辨率空中场景分类的两流深度融合框架
7. Distilling the Knowledge of Specialist Deep Neural Networks in Acoustic Scene Classification [O] . Jee-weon Jung, HeeSoo Heo, Hye-jin Shim, 2019

机译：蒸馏在声学场景分类中专家深神经网络的知识

Late fusion framework for Acoustic Scene Classification using LPCC, SCMC, and log-Mel band energies with Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅