首页> 外文会议>Conference of the International Speech Communication Association >Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data
【24h】

Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

机译:轻轻监督的石墨模型训练,用于改进语音和文本数据的句子级别对齐

获取原文

摘要

This paper introduces a method for lightly supervised discriminative training using MMI to improve the alignment of speech and text data for use in training HMM-based TTS systems for low-resource languages. In TTS applications, due to the use of long-span contexts, it is important to select training utterances which have wholly correct transcriptions. In a low-resource setting, when using poorly trained grapheme models, we show that the use of MMI discriminative training at the grapheme-level enables us to increase the amount of correctly aligned data by 40%, while maintaining a 7% sentence error rate and 0.8% word error rate. We present the procedure for lightly supervised discriminative training with regard to the objective of minimising sentence error rate.
机译:本文介绍了一种使用MMI轻轻监督鉴别训练的方法,以改善语音和文本数据的对准,以便用于培训基于HMM的TTS系统的低资源语言。在TTS应用中,由于使用长跨度上下文,选择具有完全正确转录的培训话语非常重要。在低资源设置中,当使用训练有素的图形模型时,我们表明,在图形级别使用MMI鉴别培训使我们能够将正确对齐的数据量增加40%,同时保持7%的句子错误率和0.8%的字错误率。我们介绍了在最小化句子错误率的目标方面轻微监督歧视性培训的程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号