ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS

机译：通过将基于子空间的连接方法应用于基于MEL的倒谱系数，实现低信噪比汽车环境中的鲁棒自动语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, the problem of robust large-vocabulary continuous-speech recognition (CSR) in the presence of highly interfering car noise has been considered. Our approach is based on the noise reduction of the parameters that we use for recognition, that is, the Mel-based cepstral coefficients. This is achieved by the use of a Multilayer Perceptron (MLP) network for noise reduction in the cepstral domain in order to get less-variant parameters. Then, the obtained enhanced features are refined via the Karhunen-Loeve Transform (KLT) implemented using the Principal Component Analysis (PCA). Experiments show that the use of the enhanced parameters using such an approach increases the recognition rate of the CSR process in highly interfering car noise environments. The HTK Hidden Markov Model Toolkit was used throughout our experiments. Results show that the proposed hybrid technique when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process based on either a KLT- or an MLP-based preprocessing recognition in severe interfering car noise environments for a wide range of SNRs varying from 16 dB to -4 dB using a noisy version of the TIMIT database.

机译：在本文中，已经考虑了在强烈干扰汽车噪声的情况下鲁棒的大词汇量连续语音识别（CSR）问题。我们的方法基于用于识别的参数（即基于梅尔的倒谱系数）的降噪。这是通过使用多层感知器（MLP）网络来降低倒频谱域中的噪声来实现的，以获取变化较小的参数。然后，通过使用主成分分析（PCA）实施的Karhunen-Loeve变换（KLT）精炼获得的增强特征。实验表明，在高度干扰的汽车噪声环境中，使用这种方法使用增强参数可以提高CSR过程的识别率。我们在整个实验过程中都使用了HTK隐马尔可夫模型工具包。结果表明，所提出的混合技术在基于HTK的CSR系统的前端中使用时，在严重干扰汽车噪声环境中，优于基于KLT或MLP预处理识别的常规识别过程。使用TIMIT数据库的嘈杂版本，SNR范围从16 dB到-4 dB不等。

著录项

来源
《European Conference on Speech Communication and Technology v.3; 20010903-20010907; Aalborg; DK》|2001年|P.1577-1580|共4页
会议地点 Aalborg(DK);Aalborg(DK)
作者
Sid-Ahmed Selouani; Hesham Tolba; Douglas O Shaughnessy;
展开▼
作者单位

INRS-Telecommunications, Universite du Quebec 900 de la Gauchetiere Quest, Quebec, H5A 1C6, Canada;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients [J] . Mohamed Tamazin, Ahmed Gouda, Mohamed Khedr Applied Sciences . 2019,第10期

机译：基于增强功率归一化谱系齐系数的增强的自动语音识别系统
2. Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition [J] . Skowronski MD, Harris JG The Journal of the Acoustical Society of America . 2004,第3期

机译：在语音自动识别中利用人为因素倒谱系数的独立滤波器带宽
3. MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) FEATURE EXTRACTION ENHANCEMENT IN THE APPLICATION OF SPEECH RECOGNITION: A COMPARISON STUDY [J] . SAYF A. MAJEED, HAFIZAH HUSAIN, SALINA ABDUL SAMAD, Journal of Theoretical and Applied Information Technology . 2015,第1期

机译：MEL频率倒谱系数（MFCC）特征提取在语音识别中的应用：对比研究
4. ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS [C] . Sid-Ahmed Selouani, Hesham Tolba, Douglas O Shaughnessy European conference on speech communication and technology . 2001

机译：通过应用基于晶体基的思科系数的基于基于MEL的临床系数的鲁棒汽车环境中的强大自动语音识别
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. The application of fractional Mel cepstral coefficient in deceptive speech detection [O] . Xinyu Pan, Heming Zhao, Yan Zhou -1

机译：分数梅尔倒谱系数在欺骗性语音检测中的应用
7. Automatic Speech Recognition Based on Cepstral Coefficients and a Melbased Discrete Energy Operator [O] . Hesham Tolba 1998

机译：基于倒谱系数和基于mel的离散能量算子的自动语音识别

ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS

摘要

著录项

相似文献

相关主题

期刊订阅