Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

机译：轻轻监督的石墨模型训练，用于改进语音和文本数据的句子级别对齐

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper introduces a method for lightly supervised discriminative training using MMI to improve the alignment of speech and text data for use in training HMM-based TTS systems for low-resource languages. In TTS applications, due to the use of long-span contexts, it is important to select training utterances which have wholly correct transcriptions. In a low-resource setting, when using poorly trained grapheme models, we show that the use of MMI discriminative training at the grapheme-level enables us to increase the amount of correctly aligned data by 40%, while maintaining a 7% sentence error rate and 0.8% word error rate. We present the procedure for lightly supervised discriminative training with regard to the objective of minimising sentence error rate.

机译：本文介绍了一种使用MMI轻轻监督鉴别训练的方法，以改善语音和文本数据的对准，以便用于培训基于HMM的TTS系统的低资源语言。在TTS应用中，由于使用长跨度上下文，选择具有完全正确转录的培训话语非常重要。在低资源设置中，当使用训练有素的图形模型时，我们表明，在图形级别使用MMI鉴别培训使我们能够将正确对齐的数据量增加40％，同时保持7％的句子错误率和0.8％的字错误率。我们介绍了在最小化句子错误率的目标方面轻微监督歧视性培训的程序。

著录项

来源
《Conference of the International Speech Communication Association》|2013年||共5页
会议地点
作者
Adriana Stan; Peter Bell; Junichi Yamagishi; Simon King;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912.3-532;
关键词
automatic alignment; grapheme models; light supervision; MMI; text-to-speech;

机译：自动对齐;图形模型;光监督;MMI;文字到语音;

相似文献

外文文献
中文文献
专利

1. Automatic Lecture Transcription Based on Discriminative Data Selection for Lightly Supervised Acoustic Model Training [J] . Sheng LI, Yuya AKITA, Tatsuya KAWAHARA IEICE transactions on information and systems . 2015,第8期

机译：基于区分数据选择的自动演讲转录，用于轻度监督的声学模型训练
2. ALISA: An automatic lightly supervised speech segmentation and alignment tool [J] . A. Stan, Y. Mamiya, J. Yamagishi, Computer speech and language . 2016,第JANa期

机译：ALISA：一种自动轻度监督的语音分割和对齐工具
3. Training data selection for improving discriminative training of acoustic models [J] . Berlin Chen, Shih-Hung Liu, Fang-Hui Chu Pattern recognition letters . 2009,第13期

机译：选择训练数据以改善声学模型的判别训练
4. Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data [C] . Adriana Stan, Peter Bell, Junichi Yamagishi, Conference of the International Speech Communication Association . 2013

机译：轻轻监督的石墨模型训练，用于改进语音和文本数据的句子级别对齐
5. Training Discriminative Computer Vision Models with Weak Supervision. [D] . Babenko, Boris. 2012

机译：用弱监督训练具有区别性的计算机视觉模型。
6. The First Vietnamese FOSD-Tacotron-2-based Text-to-Speech Model Dataset [O] . Duc Chung Tran 2020

机译：基于越南FOSD-Tacotron-2的文本到语音模型数据集
7. Automatic lecture transcription based on discriminative data selection for lightly supervised acoustic model training [O] . Li, Sheng, Akita, Yuya, Kawahara, Tatsuya 2015

机译：基于区分性数据选择的自动演讲转录，用于轻度监督的声学模型训练

Lightly Supervised Discriminative Training of Grapheme Models for Improved Sentence-level Alignment of Speech and Text Data

摘要

著录项

相似文献

相关主题

期刊订阅