首页> 外文会议> >Speaker-independent dictation of Chinese speech with 32K vocabulary
【24h】

Speaker-independent dictation of Chinese speech with 32K vocabulary

机译:32K词汇量的独立于说话者的听写

获取原文

摘要

While early machines adopted isolated syllables as input units and needed boring enrollment, our research focus on the speaker independent, word based dictation. A deliberately designed 120 speaker database was built for training; inter syllable context, tonal and endpoint dependent acoustic model are applied with a promising MFCC feature. Two pass acoustic matching accelerates the recognition, taking full advantage of the monosyllabic structure of Chinese speech. A complete word bigram and trigram serve as language processing module. With all efforts, the system reaches 90% character accuracy, performing in almost real time on a Pentium PC without DSP help.
机译:早期的机器采用孤立的音节作为输入单位并需要无聊的注册,而我们的研究则集中在与说话者无关的基于单词的听写上。专门设计的120个演讲者数据库已建立用于培训;在音节之间的上下文中,音调和端点相关的声学模型与有希望的MFCC功能一起应用。两遍声学匹配充分利用了中文语音的单音节结构,加快了识别速度。一个完整的单词bigram和trigram充当语言处理模块。尽一切努力,该系统达到了90%的字符精度,几乎可以在没有DSP帮助的情况下在奔腾PC上实时执行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号