Two-Level Intercoupling HMM's For Speech Recognition

机译：两级互耦HMM用于语音识别

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

A great number of experimental results in Chinese speech recognition show that the performance of a recognizer can be improved remarkably if the possible impact of the tone on speech recognition is brought into consideration. It is self-evident that the tone can be perceived independent of syllables in the utterances of different syllables with the same tone pattern, and likewise, in the utterances of the same syllable with different tone patterns the syllable can be perceived separately as well. Accordingly the speech features of the syllable should be able to be separated from that of the tone because of their independence in auditory perception and phonetics. Virtually it is extremely difficult to extract the corresponding speech features out of speech signals using the existed algorithms. It results in that the speech representations commonly used in Chinese speech recognition contain the speech features of both syllables and tones. Consequently, models of new structure are required for Chinese speech recognition in which speech features of the syllable and the tone in a syllable with some tone pattern should be modeled by two unique models. One for syllable, the other for tone, and these models should intercouple. In this paper on the basis of the fundamental framework of HMM's the authors depicted the relations between syllable HMM's and tone ones by introducing the intercoupling probabilities and originally presented a kind of two-level intercoupling HMM's. Baum-Welch algorithm was developed to reestimate their parameters. In order to reduce the amount of computation in speech recognition several approaches were suggested to simplify recognition schemes. The proposed approach increased recognition accuracy from 86.7% to 92.2% on the training set and from 82.9% to 87.3% on an independent set of test data.

机译：中文语音识别的大量实验结果表明，如果考虑到语气对语音识别的可能影响，识别器的性能将得到显着提高。不言而喻，在具有相同音调模式的不同音节的发声中，可以独立于音节来感知音调，并且同样地，在具有不同音调模式的相同音节的发声中，也可以分别地感知音节。因此，由于音节在听觉和语音上的独立性，它们的语音特征应该能够与音调分开。实际上，使用现有算法从语音信号中提取相应的语音特征极其困难。结果是，中国语音识别中常用的语音表示形式既包含音节又包含音调。因此，中文语音识别需要一种新的结构模型，在该模型中，应通过两个独特的模型来模拟音节和具有某些音调模式的音节中的音调的语音特征。一个用于音节，另一个用于音调，并且这些模型应该相互耦合。本文在HMM基础框架的基础上，通过介绍互耦概率，描述了音节HMM与音调之间的关系，并提出了一种两级互耦HMM。开发了Baum-Welch算法以重新估计其参数。为了减少语音识别中的计算量，提出了几种简化识别方案的方法。所提出的方法将训练集的识别准确率从86.7％提高到92.2％，将独立的测试数据集的识别准确率从82.9％提高到87.3％。

著录项

来源
《Proceeding of the First international conference on multimodal interfaces(ICMI'96)》|1996年|123-126|共4页
会议地点 Beijing(CN);Beijing(CN)
作者
Gongjun Li; Taiyi Huang;
展开▼
作者单位

National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences Beijing 100080, P. R. China;

National Laboratory of Pattern Recognition Institute of Automation, Chinese Academy of Sciences Beijing 100080, P. R. China;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
2. Arabic Speech Recognition System Based on MFCC and HMMs [J] . Hussien A. Elharati, Mohamed Alshaari, Veton Z. K?puska Journal of Computer and Communications . 2020,第3期

机译：基于MFCC和HMMS的阿拉伯语语音识别系统
3. Arabic Speech Recognition System Based on MFCC and HMMs [J] . Hussien A. Elharati, Mohamed Alshaari, Veton Z. K?puska 电脑和通信（英文） . 2020,第003期

机译：基于MFCC和HMM的阿拉伯语语音识别系统
4. Two-Level Intercoupling HMM's For Speech Recognition [C] . International conference on multimodal interfaces . 1996

机译：两级互动嗯，用于语音识别
5. Classification and recognition of speech under perceptual stress using neural networks and N-D HMMs. [D] . Womack, Brian David. 1996

机译：使用神经网络和N-D HMM在感知压力下对语音进行分类和识别。
6. Estimation of Phoneme-Specific HMM Topologies for the Automatic Recognition of Dysarthric Speech [O] . Santiago-Omar Caballero-Morales 2013

机译：语音异常自动识别的音素特定HMM拓扑估计
7. Fused HMM-adaptation of multi-stream HMMs for audio-visual speech recognition [O] . Dean David B., Lucey Patrick J., Sridharan Sridha, 2007

机译：多流HMM的融合HMM自适应用于视听语音识别

Two-Level Intercoupling HMM's For Speech Recognition

摘要

著录项

相似文献

相关主题

期刊订阅