首页> 外文学位 >Local feature extraction for robust speech recognition in the presence of noise.
【24h】

Local feature extraction for robust speech recognition in the presence of noise.

机译:局部特征提取可在存在噪声的情况下实现强大的语音识别。

获取原文
获取原文并翻译 | 示例

摘要

Feature extraction is one of the active research areas of speech recognition. Local features have received great attention among speech recognition researchers, since local features have advantages over the traditional speech features for reducing the effect of noise on the recognition performance. Here, our main focus is on extracting local features for speech recognition. Two new speech features are proposed for speech recognition. The first one is the lapped subband (LAP) features, extracted using the discrete orthogonal lapped transform. The second one is the Mel-Frequency Discrete Wavelet Coefficients (MFDWCs), extracted using the Discrete Wavelet Transform (DWT). The proposed features take advantage of being local in frequency domain which makes them robust to the noise.; Performance of the LAP features was evaluated on a phoneme recognition task and compared with the performance of subband (SUB) features and Mel-Frequency Cepstral Coefficients (MFCCs) features. Experimental results have shown that the proposed LAP features outperform SUB features and MFCCs features under white noise, band-limited white noise and no noise conditions. We evaluated the performance of MFDWCs for clean speech and noisy speech and compared the performance of MFDWCs with MFCCs, SUB features and multi-resolution (MULT) features. Experimental results on a phoneme recognition task showed that a MFDWC-based recognizer gave better results than recognizers based on MFCCs, SUB, and MULT features for the white Gaussian noise, band-limited white Gaussian noise and clean speech cases.; The Parallel Model Combination (PMC) technique is used along with MFDWCs features to take advantage of both noise compensation and local features (MFDWCs) to decrease the effect of noise on recognition performance. In addition a practical weighting technique based on the noise level of each coefficient is introduced. The experimental results show significant performance improvements for MFDWCs versus MFCCs after compensating the HMMs using the PMC technique. Weighting the partial score of each coefficient based on the noise level further improves the performance.; This work also investigated application of the DWT to the time-trajectory of the static coefficients to extract dynamic features.
机译:特征提取是语音识别的活跃研究领域之一。局部特征已经受到语音识别研究者的极大关注,因为局部特征相对于传统语音特征具有优势,可以减少噪声对识别性能的影响。在这里,我们的主要重点是提取语音识别的局部特征。提出了两种新的语音特征用于语音识别。第一个是重叠子带(LAP)特征,它是使用离散正交重叠变换提取的。第二个是使用离散小波变换(DWT)提取的梅尔频率离散小波系数(MFDWC)。所提出的特征利用了在频域中局部的优势,这使其对噪声具有鲁棒性。在音素识别任务上评估了LAP功能的性能,并将其与子带(SUB)功能和Mel频率倒谱系数(MFCC)功能的性能进行了比较。实验结果表明,在白噪声,带限白噪声和无噪声条件下,提出的LAP特征优于SUB特征和MFCC特征。我们评估了MFDWC在纯净语音和嘈杂语音方面的性能,并将MFDWC与MFCC,SUB功能和多分辨率(MULT)功能进行了比较。在音素识别任务上的实验结果表明,基于MFDWC的识别器比基于MFCC,SUB和MULT特征的识别器在白高斯噪声,带限白高斯噪声和干净语音情况下的识别效果更好。并行模型组合(PMC)技术与MFDWC的功能一起使用,以充分利用噪声补偿和局部特征(MFDWC)的优势,以减少噪声对识别性能的影响。另外,介绍了一种基于每个系数的噪声水平的实用加权技术。实验结果表明,在使用PMC技术补偿HMM之后,MFDWC与MFCC相比,性能有了显着提高。基于噪声水平对每个系数的部分得分进行加权可以进一步改善性能。这项工作还研究了DWT在静态系数的时间轨迹上的应用,以提取动态特征。

著录项

  • 作者

    Tufekci, Zekeriya.;

  • 作者单位

    Clemson University.;

  • 授予单位 Clemson University.;
  • 学科 Engineering Electronics and Electrical.
  • 学位 Ph.D.
  • 年度 2001
  • 页码 140 p.
  • 总页数 140
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

  • 入库时间 2022-08-17 11:46:50

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号