首页> 外文会议> >Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

【24h】

Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

机译：词汇量很大，但培训数据有限，可以完全识别连续的汉语普通话语音

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents the first known results for complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but very limited training data. Although some isolated-syllable-based or isolated-word-based large-vocabulary Mandarin speech recognition systems have been successfully developed, a continuous-speech-based system of this kind has never been reported before. For successful development of this system, several important techniques have been used, including acoustic modeling of a set of sub-syllabic models for base syllable recognition and another set of context-dependent models for tone recognition, a multiple candidate searching technique based on a concatenated syllable matching algorithm to synchronize base syllable and tone recognition, and a word-class-based Chinese language model for linguistic decoding. The best recognition accuracy achieved is 88.69% for finally decoded Chinese characters, with 88.69%, 91.57%, and 81.37% accuracy for base syllables, tones, and tonal syllables respectively.

机译：本文提出了第一个已知的结果，它可以完全识别具有很大词汇量但非常有限的培训数据的汉语连续汉语普通话。尽管已经成功地开发了一些基于孤立音节或基于单词的大词汇量普通话语音识别系统，但是从未有过这种基于连续语音的系统的报道。为了成功开发该系统，已使用了几种重要的技术，包括用于基本音节识别的一组亚音节模型的声学模型和用于音调识别的另一组与上下文相关的模型，一种基于级联的多候选搜索技术。音节匹配算法，用于同步基本音节和音调识别;以及基于单词类的中文语言模型，用于语言解码。最终解码的汉字的最佳识别准确度为88.69％，基本音节，音调和音调音节的准确度分别为88.69％，91.57％和81.37％。

著录项

来源
《》|1995年|P.61-64|共4页
会议地点
作者
Hsin-Min Wang; Jia-Lin Shen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary using limited training data [J] . Hsin-Min Wang, Tai-Hsuan Ho IEEE Transactions on Speech and Audio Proceeding . 1997,第2期

机译：使用有限的训练数据就可以完全识别具有很大词汇量的连续汉语普通话语音
2. Complete recognition of continuous Mandarin speech for Chineselanguage with very large vocabulary using limited training data [J] . Hsin-Min Wang, Tai-Hsuan Ho, Rung-Chiung Yang, IEEE Transactions on Speech and Audio Proceessing . 1997,第2期

机译：使用有限的培训数据，可以完全识别具有很大词汇量的连续汉语普通话语音
3. Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model [J] . Shen J.-L. IEE Proceedings. Part K . 1998,第5期

机译：基于分段概率模型的大词汇量汉语连续汉语语音识别
4. Fast and accurate recognition of very-large-vocabulary continuous Mandarin speech for Chinese language with improved segmental probability modeling [C] . Jia-Lin Shen, Lin-Shan Lee . 1996

机译：改进的分段概率模型可快速准确地识别超大型词汇的汉语连续汉语语音
5. Modeling lexical tones for Mandarin large vocabulary continuous speech recognition. [D] . Lei, Xin. 2006

机译：为普通话大词汇量连续语音识别建模词汇声调。
6. Using Morphological Data in Language Modeling for Serbian Large Vocabulary Speech Recognition [O] . Edvin Pakoci, Branislav Popović, Darko Pekar 2019

机译：在塞尔维亚大型词汇语音识别的语言建模中使用形态学数据
7. Complete Recognition of Continuous Mandarin Speech for Chinese Language with Very Large Vocabulary Using Limited Training Data [O] . Hsin-min Wang, Tai-hsuan Ho, Rung-chiung Yang, 1997

机译：利用有限的训练数据完全识别具有超大词汇量的汉语连续普通话
8. Use of Computer Speech Understanding in Training: A Preliminary Investigation of a Limited Continuous Speech Recognition Capability. [R] . Porter, J. E., Grady, M. W., Hicklin, M. B., 1977

机译：计算机语音理解在训练中的运用：有限连续语音识别能力的初步研究。

Complete recognition of continuous Mandarin speech for Chinese language with very large vocabulary but limited training data

摘要

著录项

相似文献

相关主题

期刊订阅