首页> 中文期刊>计算机学报 >语音识别中带宽失配的补偿研究

语音识别中带宽失配的补偿研究

     

摘要

Speech recognition systems obtaining high recognition rates in clean environments per-form badly in mismatch environments without compensation. Based on the research, we found that bandwidth mismatch, namely the bandwidth difference between the training and test condi-tions, is one of the main factors leading to environment mismatch. When the bandwidth of the test speech is narrower than that of the training speech, the distortion is non-invertible and time-varying in the logarithm spectrum and cepstrum domains. So it could not be compensated with current channel compensation methods. After analyzing the Mel-frequency cepstrum coefficient distortion caused by the lost frequency band, we propose a compensation method based on spec-tral fold. Furthermore, we provide an algorithm for speech bandwidth detection and a unified compensation framework. Experiments on the AN4 and TIMIT/TIMIT databases show that the proposed framework improved the robustness of speech recognition underbandwidth mismatch conditions.%目前的语音识别系统在训练环境与测试环境匹配的情况下具有很高的识别率,而当环境失配时,其性能将急剧下降.作者研究发现,带宽失配,即训练语料和测试语料带宽不一致,也是引起环境失配的主要原因之一.当测试语音带宽比训练语音带宽窄时,丢失的频段不可逆,且其影响在倒谱域或对数频谱域七是时变的,因而无法用目前的信道补偿方法补偿.文章在分析丢失频段对梅尔频率倒谱系数影响的基础上,提出了用频谱折叠方法对窄带测试语音进行补偿.在此基础上给出了语音带宽检测算法和带宽补偿统一框架.在AN4和TIMIT/NTIMIT数据库上的实验表明,该框架能有效增强语音识别系统在带宽失配情况下的鲁棒性.

著录项

相似文献

  • 中文文献
  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号