Robust speech recognition in noisy environments based on subband spectral centroid histograms

Gajic B.; Paliwal K.K.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Robust speech recognition in noisy environments based on subband spectral centroid histograms

【24h】

Robust speech recognition in noisy environments based on subband spectral centroid histograms

机译：基于子带频谱质心直方图的嘈杂环境中的鲁棒语音识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We investigate how dominant-frequency information can be used in speech feature extraction to increase the robustness of automatic speech recognition against additive background noise. First, we review several earlier proposed auditory-based feature extraction methods and argue that the use of dominant-frequency information might be one of the major reasons for their improved noise robustness. Furthermore, we propose a new feature extraction method, which combines subband power information with dominant subband frequency information in a simple and computationally efficient way. The proposed features are shown to be considerably more robust against additive background noise than standard mel-frequency cepstrum coefficients on two different recognition tasks. The performance improvement increased as we moved from a small-vocabulary isolated-word task to a medium-vocabulary continuous-speech task, where the proposed features also outperformed a computationally expensive auditory-based method. The greatest improvement was obtained for noise types characterized by a relatively flat spectral density.

机译：我们研究如何将主导频率信息用于语音特征提取，以提高针对附加背景噪声的自动语音识别的鲁棒性。首先，我们回顾了几种较早提出的基于听觉的特征提取方法，并认为使用主导频率信息可能是其改善噪声鲁棒性的主要原因之一。此外，我们提出了一种新的特征提取方法，该方法以简单且计算有效的方式将子带功率信息与主要子带频率信息相结合。在两个不同的识别任务上，与标准的mel-频率倒谱系数相比，拟议的功能在抵抗加性背景噪声方面表现出更强大的性能。当我们从小词汇孤立词任务转移到中等词汇连续语音任务时，性能提高有所提高，其中建议的功能也胜过基于计算的听觉方法。对于以相对平坦的频谱密度为特征的噪声类型，获得了最大的改进。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2006年第2期|p.600-608|共9页
作者
Gajic B.; Paliwal K.K.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
feature extraction; speech processing; speech recognition; auditory-based feature extraction methods; dominant-frequency information; medium-vocabulary continuous-speech task; robust speech recognition; small-vocabulary isolated-word task; speech feature extract;

机译：特征提取;语音处理;语音识别;基于听觉的特征提取方法;主频信息;中词汇连续语音任务;鲁棒语音识别;小词汇孤立词任务;语音特征提取;

相似文献

外文文献
中文文献
专利

1. Recognition of noisy speech using dynamic spectral subband centroids [J] . Jingdong Chen, Yiteng Huang, Qi Li, IEEE signal processing letters . 2004,第2期

机译：使用动态频谱子带质心识别嘈杂的语音
2. Recognition of noisy speech using dynamic spectral subband centroids [J] . Jingdong Chen, Yiteng Huang, Qi Li, IEEE signal processing letters . 2004,第期

机译：使用动态频谱子带质心识别嘈杂的语音
3. An effective cluster-based model for robust speech detection and speech recognition in noisy environments [J] . Gorriz JM, Ramirez J, Segura JC, The Journal of the Acoustical Society of America . 2006,第1期

机译：在嘈杂环境中用于鲁棒语音检测和语音识别的有效基于群集的模型
4. Robust Parameters for Speech Recognition Based on Subband Spectral Centroid Histograms [C] . Bojana Gajic, Kuldip K. Paliwal European conference on speech communication and technology . 2001

机译：基于子带谱质心直方图的语音识别的鲁棒参数
5. Compressive nonlinearity for representing speech spectral magnitude to improve noise robustness of automatic speech recognition . [D] . Wong, Brian. 2011

机译：压缩非线性表示语音频谱幅度提高语音自动识别的鲁棒性。
6. Robust EEG-Based Decoding of Auditory Attention With High-RMS-Level Speech Segments in Noisy Conditions [O] . Lei Wang, Ed X. Wu, Fei Chen 2020

机译：基于危险的eeg的eeg的解码在嘈杂的条件下具有高rms级语音段的听觉注意力
7. Robust Speech Recognition in Noisy Environments Based on Subband Spectral Centroid Histograms [O] . Paliwal Kuldip 2006

机译：基于子带谱质心直方图的嘈杂环境中的鲁棒语音识别

Robust speech recognition in noisy environments based on subband spectral centroid histograms

摘要

著录项

相似文献

相关主题

期刊订阅