Speaker-independent dictation of Chinese speech with 32K vocabulary

机译：32K词汇量的独立于说话者的听写

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

While early machines adopted isolated syllables as input units and needed boring enrollment, our research focus on the speaker independent, word based dictation. A deliberately designed 120 speaker database was built for training; inter syllable context, tonal and endpoint dependent acoustic model are applied with a promising MFCC feature. Two pass acoustic matching accelerates the recognition, taking full advantage of the monosyllabic structure of Chinese speech. A complete word bigram and trigram serve as language processing module. With all efforts, the system reaches 90% character accuracy, performing in almost real time on a Pentium PC without DSP help.

机译：早期的机器采用孤立的音节作为输入单位并需要无聊的注册，而我们的研究则集中在与说话者无关的基于单词的听写上。专门设计的120个演讲者数据库已建立用于培训;在音节之间的上下文中，音调和端点相关的声学模型与有希望的MFCC功能一起应用。两遍声学匹配充分利用了中文语音的单音节结构，加快了识别速度。一个完整的单词bigram和trigram充当语言处理模块。尽一切努力，该系统达到了90％的字符精度，几乎可以在没有DSP帮助的情况下在奔腾PC上实时执行。

著录项

来源
《》|1996年|P.2320-2323|共4页
会议地点
作者
Bo Xu; Bing Ma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Golden Mandarin (I)-A real-time Mandarin speech dictation machine for Chinese language with very large vocabulary [J] . Lee L.S., Tseng C.Y. IEEE Transactions on Speech and Audio Proceeding . 1993,第2期

机译：黄金汉语（I）-实时汉语听写机，词汇量大
2. Adaptive compensation algorithm in open vocabulary mandarin speaker-independent speech recognition [J] . Al-dulaimy Fadhil H. T., Wang Zuoying, Tian Ye Tsinghua Science and Technology . 2002,第5期

机译：开放式普通话独立于说话人的语音识别中的自适应补偿算法
3. Adaptive Compensation Algorithm in Open Vocabulary Mandarin Speaker-Independent Speech Recognition [J] . 清华大学学报（英文版） . 2002,第005期

机译：开放词汇普通话独立语音识别中的自适应补偿算法
4. Speaker-independent dictation of Chinese speech with 32K vocabulary [C] . Bo Xu, Bing Ma, Institute of Electric and Electronic Engineer International Conference on Spoken Language . 1996

机译：32K词汇的讲话者的汉语演讲听写
5. Large-vocabulary speaker-independent continuous speech recognition: The SPHINX system. [D] . Lee, Kai-Fu. 1988

机译：独立于大词汇的说话者的连续语音识别：SPHINX系统。
6. Speaker-independent auditory attention decoding without access to clean speech sources [O] . Cong Han, James O’Sullivan, Yi Luo, 2019

机译：独立于说话者的听觉注意解码而没有干净的语音来源
7. Speaker-independent Dictation of Chinese Speech with 32K Vocabulary [O] . Bo Xu, Bing Ma, Shuwu Zhang, 2007

机译：独立于说话者的32K词汇汉语口语听写

Speaker-independent dictation of Chinese speech with 32K vocabulary

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅