首页> 外文会议>IEEE International Conference on Acoustics, Speech, and Signal Processing >TRIPHONE MODEL RECONSTRUCTION FOR MANDARIN PRONUNCIATION VARIATIONS
【24h】

TRIPHONE MODEL RECONSTRUCTION FOR MANDARIN PRONUNCIATION VARIATIONS

机译:Triphone模型重建普通话发音变化

获取原文

摘要

The high error rate of recognition accuracy in spontaneous speech is due in part to the poor modeling of pronunciations. In this paper, we propose modeling pronunciation variations through triphone model reconstruction. We first generate partial change phone model (PCPM) to differentiate pronunciation variations. In order to improve the resolution of triphone models, PCPM is used as a hidden model and merged into the pre-trained acoustic model through model reconstruction. To avoid model confusion, auxiliary decision trees are established for triphone PCPMs. The acoustic model reconstruction on triphones is equivalent to decision tree merging. The effectiveness of this approach is evaluated on the 1997 Hub4NE Mandarin Broadcast News Corpus (1997 MBN) with different styles of speech. It gives a significant 2.39% absolute syllable error rate reduction in spontaneous speech.
机译:自发语音中的识别准确度的高误差率部分是由于发音不良建模。在本文中,我们建议通过Trighone模型重建建模的发音变化。我们首先生成部分更改电话型号(PCPM)以区分发音变化。为了改善Triphone模型的分辨率,PCPM用作隐藏模型,并通过模型重建合并到预训练的声学模型中。为避免模型混淆,为Trighone PCPMS建立辅助决策树。 Triphones上的声学模型重建相当于决策树合并。这种方法的有效性在1997年的Hub4ne普通话广播新闻语料库(1997 MBN)中评估了不同风格的语音。它在自发语音中提出了显着的2.39%的绝对音节错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号