A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification

Chu W.; Champagne B.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification

【24h】

A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification

机译：基于噪声稳健FFT的听觉谱及其在音频分类中的应用

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

In this paper, we investigate the noise robustness of Wang and Shamma''s early auditory (EA) model for the calculation of an auditory spectrum in audio classification applications. First, a stochastic analysis is conducted wherein an approximate expression of the auditory spectrum is derived to justify the noise-suppression property of the EA model. Second, we present an efficient fast Fourier transform (FFT)-based implementation for the calculation of a noise-robust auditory spectrum, which allows flexibility in the extraction of audio features. To evaluate the performance of the proposed FFT-based auditory spectrum, a set of speech/music/noise classification tasks is carried out wherein a support vector machine (SVM) algorithm and a decision tree learning algorithm (C4.5) are used as the classifiers. Features used for classification include conventional Mel-frequency cepstral coefficients (MFCCs), MFCC-like features obtained from the original auditory spectrum (i.e., based on the EA model) and the proposed FFT-based auditory spectrum, as well as spectral features (spectral centroid, bandwidth, etc.) computed from the latter. Compared to the conventional MFCC features, both the MFCC-like and spectral features derived from the proposed FFT-based auditory spectrum show more robust performance in noisy test cases. Test results also indicate that, using the new MFCC-like features, the performance of the proposed FFT-based auditory spectrum is slightly better than that of the original auditory spectrum, while its computational complexity is reduced by an order of magnitude.

机译：在本文中，我们研究了Wang和Shamma的早期听觉（EA）模型的噪声鲁棒性，用于计算音频分类应用中的听觉频谱。首先，进行随机分析，其中导出听觉频谱的近似表达式以证明EA模型的噪声抑制特性是正确的。其次，我们提出了一种基于高效快速傅立叶变换（FFT）的实现，用于计算噪声健壮的听觉频谱，从而可以灵活地提取音频特征。为了评估建议的基于FFT的听觉频谱的性能，执行了一组语音/音乐/噪声分类任务，其中支持向量机（SVM）算法和决策树学习算法（C4.5）被用作分类器。用于分类的特征包括常规的梅尔频率倒谱系数（MFCC），从原始听觉频谱（即，基于EA模型）和拟议的基于FFT的听觉频谱以及频谱特征（频谱质心，带宽等）。与常规MFCC功能相比，从拟议的基于FFT的听觉频谱中获得的类MFCC和频谱特征在嘈杂的测试案例中均显示出更强大的性能。测试结果还表明，使用类似于MFCC的新功能，建议的基于FFT的听觉频谱的性能比原始听觉频谱的性能稍好，同时其计算复杂度降低了一个数量级。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2008年第1期|p.137-150|共14页
作者
Chu W.; Champagne B.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
Audio classification; C4.5; early auditory (EA) model; noise suppression; self-normalization; support vector machine (SVM);

机译：音频分类;C4.5;早期听觉（EA）模型;噪声抑制;自归一化;支持向量机（SVM）;

相似文献

外文文献
中文文献
专利

1. FFT-Based Data Hiding on Audio in LWT-Domain Using Spread Spectrum Technique [J] . Budiman Gelar, Suksmono Andriyan Bayu, Danudirdjo Donny Elektronika ir Elektrotechnika . 2020,第3期

机译：基于FFT的数据在LWT域中掩藏了光谱技术的音频
2. A simplified early auditory model with application in audio classification [J] . Wei Chu, Benoit Champagne Canadian journal of electrical and computer engineering . 2006,第4期

机译：简化的早期听觉模型及其在音频分类中的应用
3. Multiscale 2-D Singular Spectrum Analysis and Principal Component Analysis for Spatial–Spectral Noise-Robust Feature Extraction and Classification of Hyperspectral Images [J] . Ping Ma, Jinchang Ren, Huimin Zhao, Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2021,第1期

机译：多尺度2-D奇异谱分析和空间谱稳健功能提取和超光谱分类的主成分分析
4. Further Studies of a FFT-Based Auditory Spectrum with Application in Audio Classification [C] . Wei Chu, Beno(i)t Champagne 9th International Conference on Signal Processing(第九届国际信号处理学术会议)（ICSP'08）论文集 . 2008

机译：基于FFT的听觉频谱的进一步研究及其在音频分类中的应用
5. Auditory-Based Noise-Robust Audio Classification Algorithms. [D] . Chu, Wei. 2008

机译：基于听觉的强噪声音频分类算法。
6. Automatic Detection and Classification of Audio Events for Road Surveillance Applications [O] . Noor Almaadeed, Muhammad Asim, Somaya Al-Maadeed, 2018

机译：用于道路监控应用的音频事件的自动检测和分类
7. A noise-robust FFT-based spectrum for audio classification [O] . Wei Chu, Benoît Champagne 2006

机译：用于音频分类的基于FFT的噪声稳健频谱

A Noise-Robust FFT-Based Auditory Spectrum With Application in Audio Classification

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅