Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs

机译：基于稀疏表示特征和GPC的真实世界语音/非语音音频分类

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A novel and robust approach for content based speechon-speech audio classification is proposed based on sparse representation (SR) features and Gaussian process classifiers (GPCs). The projections of the noise robust sparse representations for audio signals computed by L_1 -norm minimization are used as features. GPCs are used to learn and predict audio categories. Compare to the difficulties of Support Vector Machines (SVMs) in determining the hyperparameters, GPCs employ Bayesian selection criterion to estimate them. Experimental results on real-world audio datasets show that the SR features are more robust to audio variants than mel-frequency cepstral coefficients (MFCCs) and the proposed approach gives better performances than SVM.

机译：提出了一种基于稀疏表示（SR）特征和高斯过程分类器（GPC）的基于内容的语音/非语音音频分类的新颖，鲁棒的方法。通过L_1范数最小化计算的音频信号的鲁棒性稀疏表示的投影用作特征。 GPC用于学习和预测音频类别。与支持向量机（SVM）确定超参数的困难相比，GPC使用贝叶斯选择准则对其进行估计。在现实世界的音频数据集上的实验结果表明，SR特性对音频变体的抵抗力比梅尔频率倒谱系数（MFCC）强，并且所提出的方法比SVM具有更好的性能。

著录项

来源
《Annual conference of the International Speech Communication Association;INTERSPEECH 2011》|2011年|p.2412-2415|共4页
会议地点
作者
Ziqiang Shi; Jiqing Han; Tieran Zheng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信;
关键词
gaussian process classifiers; sparse representation; audio classification; L_1 -minimization; speech discrimination;

机译：高斯过程分类器;稀疏表示音频分类; L_1-最小化;言语歧视;

相似文献

外文文献
中文文献
专利

1. Gammatone Cepstral Coefficients: Biologically Inspired Features for Non-Speech Audio Classification [J] . Valero X., Alias F. Multimedia, IEEE Transactions on . 2012,第6期

机译：γ倒谱系数：非语音音频分类的生物学启发特征
2. Multistream sparse representation features for noise robust audio-visual speech recognition [J] . Peng Shen, Satoru Hayamizu, Satoshi Tamura Acoustical science and technology . 2014,第1期

机译：多流稀疏表示功能可实现强大的抗噪视听语音识别
3. Sparse Representation for Tumor Classification Based on Feature Extraction Using Latent Low-Rank Representation [J] . Bin Gan, Chun-Hou Zheng, Jun Zhang, BioMed research international . 2014,第4期

机译：基于潜在低秩表示的特征提取的肿瘤分类的稀疏表示
4. Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs [C] . Ziqiang Shi, Jiqing Han, Tieran Zheng Annual conference of the International Speech Communication Association . 2011

机译：基于稀疏表示功能和GPC的真实世界语音/非语音音频分类
5. Sparse Representations and Feature Learning for Image Set Classification and Correspondence Estimation [D] . Fathy, Mohammed E. Fathy. 2018

机译：图像集分类和对应估计的稀疏表示和特征学习
6. Sparse Representation for Tumor Classification Based on Feature Extraction Using Latent Low-Rank Representation [O] . Bin Gan, Chun-Hou Zheng, Jun Zhang, -1

机译：基于潜在低秩表示的特征提取的肿瘤分类的稀疏表示
7. Multistream sparse representation features for noise robust audio-visual speech recognition [O] . Peng Shen, Satoshi Tamura, Satoru Hayamizu 2014

机译：MultiStream稀疏表示功能，用于噪声强大的视听语音语音识别
8. Recognition of Three Distinctive Features in Brief-Duration Complex Non-Speech Sounds. [R] . Silverman, E. B., Howard, J. H. 1977

机译：识别短时复杂非语音中的三个特征。

Real-World Speech/Non-Speech Audio Classification Based on Sparse Representation Features and GPCs

摘要

著录项

相似文献

相关主题

期刊订阅