首页> 外文期刊>Tsinghua Science and Technology >Nonlinear time-frequency distributions of spectrum energy operator in large vocabulary mandarin speaker independent speech recognition system
【24h】

Nonlinear time-frequency distributions of spectrum energy operator in large vocabulary mandarin speaker independent speech recognition system

机译:大型汉语普通话独立语音识别系统中频谱能量算子的非线性时频分布

获取原文
获取原文并翻译 | 示例
       

摘要

This work demonstrates the use of the nonlinear time-frequency distribution (NLTFD) of a discrete time energy operator (DTEO) based on amplitude demodulation modulation demodulation techniques as a feature in speech recognition. The duration distribution based hidden Markov module in a speaker independent large vocabulary mandarin speech recognition system was reconstructed from the feature vectors in the front-end detection stage. The goal was to improve the performance of the existing system by corrbining new features to the baseline ne feature vector. This paper also deals with errors associated with using a pre-emphasis filter in the front end processing of the present scheme, which causes an increase in the noise energy at high frequencies above 4 kHz and in some cases degrades the recognition accuracy. The experimental results show that eliminating the pre-emphasis filters from the pre-processing stage and using NLTFD with compensated DTEO combined with Mel frequency cepstrum components give a 21. 95% reduction in the relative error rate corrpared to the conventional technique with 25 candidates used in the test.
机译:这项工作演示了基于幅度解调调制解调技术作为语音识别功能的离散时间能量算子(DTEO)的非线性时频分布(NLTFD)的使用。在前端检测阶段,从特征向量重构了说话人无关的大词汇量普通话语音识别系统中基于持续时间分布的隐马尔可夫模块。目的是通过将新特征组合到基线和特征向量上来改善现有系统的性能。本文还处理了与在本方案的前端处理中使用预加重滤波器相关的误差,这会导致在4 kHz以上的高频下噪声能量增加,并且在某些情况下会降低识别精度。实验结果表明,从预处理阶段取消预加重滤波器,并使用带补偿DTEO的NLTFD和梅尔频率倒谱分量的组合,相对误差率降低了21. 95%,与传统技术相比,减少了25个候选值在测试中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号