首页> 外文期刊>電子情報通信学会技術研究報告. スマートインフォメディアシステム >New Speech Features Based on time-varying LPC for Robust Automatic Speech Recognition
【24h】

New Speech Features Based on time-varying LPC for Robust Automatic Speech Recognition

机译:基于时变LPC的新语音功能可实现鲁棒的自动语音识别

获取原文
获取原文并翻译 | 示例
           

摘要

Discrimination of similar pronunciation phrases is more difficult than that of normal phrases. In this paper, we propose the use of fast Fourier transform (FFT) based mel frequency cepstral coefficients (MFCCs) with time-varying linear predictive coding (TVLPC) based cepstrum. Due to the presence of intra-frame variations in TVLPC, it is expected that this approach will improve speech recognition. Evaluation results demonstrate that the proposed approach achieves between 46.67% and 60% and between 76.67% and 86.67% using dynamic range adjustment (DRA) as noise suppression technique at 10 dB and 20 dB SNR respectively. This is in comparison to 43.33% and 70% recognition accuracy achieved using conventional approach under similar conditions.
机译:区分相似发音短语比普通短语更困难。在本文中,我们建议使用基于快速傅里叶变换(FFT)的梅尔频率倒谱系数(MFCCs)和基于时变线性预测编码(TVLPC)的倒频谱。由于TVLPC中存在帧内变化,因此预计该方法将改善语音识别。评估结果表明,使用动态范围调整(DRA)作为噪声抑制技术,分别在SNR为10 dB和20 dB时,该方法可达到46.67%至60%以及76.67%至86.67%。相比之下,使用传统方法在相似条件下可达到43.33%和70%的识别精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号