首页> 外文期刊>Research journal of applied science, engineering and technology >Combining Lightly-supervised Learning and User Feedback to Construct and Improve a Statistical Parametric Speech Synthesizer for Malay
【24h】

Combining Lightly-supervised Learning and User Feedback to Construct and Improve a Statistical Parametric Speech Synthesizer for Malay

机译:将轻型监督的学习和用户反馈结合起来构建和改进马来语的统计参数语音合成器

获取原文
           

摘要

In this study, we aim to reduce the human effort in preparing training data for synthesizing human speech and improve the quality of synthetic speech. In spite of the learning-from-data used to train the statistical models, the construction of a statistical parametric speech synthesizer involves substantial human effort, especially when using imperfect data or working on a new language. Here, we use lightly-supervised methods for preparing the data and constructing the text-processing front end. This initial system is then iteratively improved using active learning in which feedback from users is used to disambiguate the pronunciation system in our chosen language, Malay. The data are prepared using speaker diarisation and lightly-supervised text-speech alignment. In the front end, grapheme-based units are used. The active learning used small amounts of feedback from a listener to train a classifier. We report evaluations of two systems built from high-quality studio data and lower-quality `found' data respectively and show that the intelligibility of each can be improved using active learning.
机译:在这项研究中,我们旨在减少人类努力,为综合人类言论进行培训,提高合成言论的质量。尽管用于训练统计模型的学习 - 从数据,但统计参数合成器的构建涉及大量的人力努力,特别是在使用不完美的数据或在新语言上工作时。在这里,我们使用轻型监督方法来准备数据并构建文本处理前端。然后使用主动学习迭代地改善了该初始系统,其中用户使用来自用户的反馈来消除我们所选语言的发音系统,马来语。数据使用扬声器估计和轻微监督的文本语音对齐来制备。在前端,使用基于格子的单元。主动学习使用少量听众的反馈来训练分类器。我们报告了两种从高质量的工作室数据和较低质量的“找到”数据构建的两个系统的评估,并显示使用主动学习可以改善每个系统的可理解性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号