【24h】

Two-Level Intercoupling HMM's For Speech Recognition

机译:两级互耦HMM用于语音识别

获取原文
获取原文并翻译 | 示例

摘要

A great number of experimental results in Chinese speech recognition show that the performance of a recognizer can be improved remarkably if the possible impact of the tone on speech recognition is brought into consideration. It is self-evident that the tone can be perceived independent of syllables in the utterances of different syllables with the same tone pattern, and likewise, in the utterances of the same syllable with different tone patterns the syllable can be perceived separately as well. Accordingly the speech features of the syllable should be able to be separated from that of the tone because of their independence in auditory perception and phonetics. Virtually it is extremely difficult to extract the corresponding speech features out of speech signals using the existed algorithms. It results in that the speech representations commonly used in Chinese speech recognition contain the speech features of both syllables and tones. Consequently, models of new structure are required for Chinese speech recognition in which speech features of the syllable and the tone in a syllable with some tone pattern should be modeled by two unique models. One for syllable, the other for tone, and these models should intercouple. In this paper on the basis of the fundamental framework of HMM's the authors depicted the relations between syllable HMM's and tone ones by introducing the intercoupling probabilities and originally presented a kind of two-level intercoupling HMM's. Baum-Welch algorithm was developed to reestimate their parameters. In order to reduce the amount of computation in speech recognition several approaches were suggested to simplify recognition schemes. The proposed approach increased recognition accuracy from 86.7% to 92.2% on the training set and from 82.9% to 87.3% on an independent set of test data.
机译:中文语音识别的大量实验结果表明,如果考虑到语气对语音识别的可能影响,识别器的性能将得到显着提高。不言而喻,在具有相同音调模式的不同音节的发声中,可以独立于音节来感知音调,并且同样地,在具有不同音调模式的相同音节的发声中,也可以分别地感知音节。因此,由于音节在听觉和语音上的独立性,它们的语音特征应该能够与音调分开。实际上,使用现有算法从语音信号中提取相应的语音特征极其困难。结果是,中国语音识别中常用的语音表示形式既包含音节又包含音调。因此,中文语音识别需要一种新的结构模型,在该模型中,应通过两个独特的模型来模拟音节和具有某些音调模式的音节中的音调的语音特征。一个用于音节,另一个用于音调,并且这些模型应该相互耦合。本文在HMM基础框架的基础上,通过介绍互耦概率,描述了音节HMM与音调之间的关系,并提出了一种两级互耦HMM。开发了Baum-Welch算法以重新估计其参数。为了减少语音识别中的计算量,提出了几种简化识别方案的方法。所提出的方法将训练集的识别准确率从86.7%提高到92.2%,将独立的测试数据集的识别准确率从82.9%提高到87.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号