首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >MAIN VOWEL DOMAIN TONE MODELING WITH LEXICAL AND PROSODIC ANALYSIS FOR MANDARIN ASR
【24h】

MAIN VOWEL DOMAIN TONE MODELING WITH LEXICAL AND PROSODIC ANALYSIS FOR MANDARIN ASR

机译:普通话与普通话分析的主要元音域音色建模

获取原文

摘要

The tone is a distinctive discriminative feature in Mandarin Chinese. Often functional, yet seldom thorough are most large-scale Mandarin speech recognition systems in treating tone modeling. In particular, many lack the necessary sophistication to deal with the myriad variations arising from the combination of acoustic and lexical contexts. This paper reports an attempt to account for these variabilities and to bring richer tone modeling into the IBM Mandarin broadcast transcription system. In particular, we describe a system that combines the embedded approach and a novel explicit tone modeling technique characterized by a. robust tone tracking in the main-vowel domain, and b. context-dependent models with lexical and prosodic contexts. The proposed method is validated on a connected-digits set and subsequently evaluated on a large-vocabulary broadcast transcription task. It is shown that 14.8percent and 5.4percent relative reductions in character error rate are achieved respectively.
机译:语气是普通话中的一个独特的歧视特征。经常练功,很少彻底是大量大规模的普通话语音识别系统,治疗语气建模。特别是,许多人缺乏对来自声学和词汇表的组合产生的无数变化的必要复杂性。本文报告了试图考虑这些可变性,并将更丰富的口气建模纳入IBM普通话广播转录系统。特别地,我们描述了一种组合嵌入方法的系统和一种新颖的显式音调建模技术,其特征是a。主元音域中的强大音调跟踪,b。与词汇和韵律上下文的上下文相关模型。所提出的方法在连接的数字集上验证,随后在大词汇表广播转录任务上进行评估。结果表明,分别实现了14.8%和5.4分别以字符错误率的相对减少。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号