首页> 外文会议>European conference on speech communication and technology >ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS

【24h】

ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS

机译：通过应用基于晶体基的思科系数的基于基于MEL的临床系数的鲁棒汽车环境中的强大自动语音识别

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, the problem of robust large-vocabulary continuous-speech recognition (CSR) in the presence of highly interfering car noise has been considered. Our approach is based on the noise reduction of the parameters that we use for recognition, that is, the Mel-based cepstral coefficients. This is achieved by the use of a Multilayer Perceptron (MLP) network for noise reduction in the cepstral domain in order to get less-variant parameters. Then, the obtained enhanced features are refined via the Karhunen-Loeve Transform (KLT) implemented using the Principal Component Analysis (PCA). Experiments show that the use of the enhanced parameters using such an approach increases the recognition rate of the CSR process in highly interfering car noise environments. The HTK Hidden Markov Model Toolkit was used throughout our experiments. Results show that the proposed hybrid technique when included in the front-end of an HTK-based CSR system, outperforms that of the conventional recognition process based on either a KLT- or an MLP-based preprocessing recognition in severe interfering car noise environments for a wide range of SNRs varying from 16 dB to -4 dB using a noisy version of the TIMIT database.

机译：在本文中，已经考虑了在存在高度干扰的汽车噪声存在下稳健的大词汇连续语音识别（CSR）的问题。我们的方法是基于我们用于识别的参数的降噪，即基于MEL的抗痉挛系数。这是通过使用多层感知者（MLP）网络来实现抗搏斯域中的降噪，以获得较少变体的参数。然后，通过使用主成分分析（PCA）实现的Karhunen-Loeve变换（KLT）来改进所获得的增强特征。实验表明，使用这种方法的增强参数的使用增加了CSR过程在高度干扰的汽车噪声环境中的识别率。在我们的实验中使用了HTK隐藏马尔可夫模型工具包。结果表明，所提出的混合技术当包含在基于HTK的CSR系统的前端时，胜过传统识别过程的基于KLT或基于MLP的预处理识别，从而在严重干扰汽车噪声环境中实现了传统识别过程使用Timit数据库的嘈杂版本，宽范围为16 dB至-4 dB。

著录项

来源
《European conference on speech communication and technology》|2001年||共4页
会议地点
作者
Sid-Ahmed Selouani; Hesham Tolba; Douglas O Shaughnessy;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类传播理论;
关键词

相似文献

外文文献
中文文献
专利

1. Enhanced Automatic Speech Recognition System Based on Enhancing Power-Normalized Cepstral Coefficients [J] . Mohamed Tamazin, Ahmed Gouda, Mohamed Khedr Applied Sciences . 2019,第10期

机译：基于增强功率归一化谱系齐系数的增强的自动语音识别系统
2. Exploiting independent filter bandwidth of human factor cepstral coefficients in automatic speech recognition [J] . Skowronski MD, Harris JG The Journal of the Acoustical Society of America . 2004,第3期

机译：在语音自动识别中利用人为因素倒谱系数的独立滤波器带宽
3. MEL FREQUENCY CEPSTRAL COEFFICIENTS (MFCC) FEATURE EXTRACTION ENHANCEMENT IN THE APPLICATION OF SPEECH RECOGNITION: A COMPARISON STUDY [J] . SAYF A. MAJEED, HAFIZAH HUSAIN, SALINA ABDUL SAMAD, Journal of Theoretical and Applied Information Technology . 2015,第1期

机译：MEL频率倒谱系数（MFCC）特征提取在语音识别中的应用：对比研究
4. ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS [C] . Sid-Ahmed Selouani, Hesham Tolba, Douglas O Shaughnessy European conference on speech communication and technology . 2001

机译：通过应用基于晶体基的思科系数的基于基于MEL的临床系数的鲁棒汽车环境中的强大自动语音识别
5. Estimation of cepstral coefficients for robust speech recognition. [D] . Indrebo, Kevin M. 2008

机译：倒频谱系数的估计，用于鲁棒的语音识别。
6. The application of fractional Mel cepstral coefficient in deceptive speech detection [O] . Xinyu Pan, Heming Zhao, Yan Zhou -1

机译：分数梅尔倒谱系数在欺骗性语音检测中的应用
7. Automatic Speech Recognition Based on Cepstral Coefficients and a Melbased Discrete Energy Operator [O] . Hesham Tolba 1998

机译：基于倒谱系数和基于mel的离散能量算子的自动语音识别

ROBUST AUTOMATIC SPEECH RECOGNITION IN LOW-SNR CAR ENVIRONMENTS BY THE APPLICATION OF A CONNECTIONIST SUBSPACE-BASED APPROACH TO THE MEL-BASED CEPSTRAL COEFFICIENTS

摘要

著录项

相似文献

相关主题

期刊订阅