...
首页> 外文期刊>Expert Systems with Application >Investigation of acoustic and visual features for acoustic scene classification
【24h】

Investigation of acoustic and visual features for acoustic scene classification

机译:声学和视觉特征的声学场景分类研究

获取原文
获取原文并翻译 | 示例
           

摘要

Acoustic scene classification has gained great interests in recent years due to its diverse applications. Various acoustic and visual features have been proposed and evaluated. However, few studies have investigated acoustic and visual feature aggregation for acoustic scene classification. In this paper, we investigated various feature sets based on the fusion of acoustic and visual features. Specifically, acoustic features are directly extracted from the waveform: spectral centroid, spectral entropy, spectral flux, spectral roll-off, short-time energy, zero-crossing rate, and Mel-frequency Cepstral coefficients. For visual features, we calculate local binary pattern, histogram of gradients, and moments based on the audio scene time-frequency representation. Then, three feature selection algorithms are applied to various feature sets to reduce feature dimensionality: correlation-based feature selection, principal component analysis, and ReliefF. Experimental results show that our proposed system was able to achieve an accuracy improvement of 15.43% compared to the baseline system with the development set. When all development sets are used for training, the performance based on the evaluation set provided by the TUT Acoustic scene 2016 challenge is 87.44%, which is the fourth best among all non-neural network systems. (C) 2019 Elsevier Ltd. All rights reserved.
机译:近年来,声场分类由于其多样化的应用而引起了人们的极大兴趣。已经提出并评估了各种声学和视觉特征。但是,很少有研究针对声学场景分类研究声学和视觉特征聚合。在本文中,我们研究了基于声学和视觉特征融合的各种特征集。具体而言,直接从波形中提取声学特征:频谱质心,频谱熵,频谱通量,频谱滚降,短时能量,过零率和梅尔频率倒谱系数。对于视觉功能,我们根据音频场景的时频表示来计算局部二进制模式,梯度直方图和矩。然后,将三种特征选择算法应用于各种特征集以减少特征维数:基于相关性的特征选择,主成分分析和ReliefF。实验结果表明,与带有开发套件的基准系统相比,我们提出的系统能够将精度提高15.43%。当所有开发集都用于培训时,基于TUT Acoustic场景2016挑战提供的评估集的性能为87.44%,在所有非神经网络系统中排名第四。 (C)2019 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号