Combining Lightly-supervised Learning and User Feedback to Construct and Improve a Statistical Parametric Speech Synthesizer for Malay

Lau Chee Yong; Oliver Watts; Simon King

首页> 外文期刊>Research journal of applied science, engineering and technology >Combining Lightly-supervised Learning and User Feedback to Construct and Improve a Statistical Parametric Speech Synthesizer for Malay

【24h】

Combining Lightly-supervised Learning and User Feedback to Construct and Improve a Statistical Parametric Speech Synthesizer for Malay

机译：将轻型监督的学习和用户反馈结合起来构建和改进马来语的统计参数语音合成器

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we aim to reduce the human effort in preparing training data for synthesizing human speech and improve the quality of synthetic speech. In spite of the learning-from-data used to train the statistical models, the construction of a statistical parametric speech synthesizer involves substantial human effort, especially when using imperfect data or working on a new language. Here, we use lightly-supervised methods for preparing the data and constructing the text-processing front end. This initial system is then iteratively improved using active learning in which feedback from users is used to disambiguate the pronunciation system in our chosen language, Malay. The data are prepared using speaker diarisation and lightly-supervised text-speech alignment. In the front end, grapheme-based units are used. The active learning used small amounts of feedback from a listener to train a classifier. We report evaluations of two systems built from high-quality studio data and lower-quality `found' data respectively and show that the intelligibility of each can be improved using active learning.

机译：在这项研究中，我们旨在减少人类努力，为综合人类言论进行培训，提高合成言论的质量。尽管用于训练统计模型的学习 - 从数据，但统计参数合成器的构建涉及大量的人力努力，特别是在使用不完美的数据或在新语言上工作时。在这里，我们使用轻型监督方法来准备数据并构建文本处理前端。然后使用主动学习迭代地改善了该初始系统，其中用户使用来自用户的反馈来消除我们所选语言的发音系统，马来语。数据使用扬声器估计和轻微监督的文本语音对齐来制备。在前端，使用基于格子的单元。主动学习使用少量听众的反馈来训练分类器。我们报告了两种从高质量的工作室数据和较低质量的“找到”数据构建的两个系统的评估，并显示使用主动学习可以改善每个系统的可理解性。

著录项

来源
《Research journal of applied science, engineering and technology》 |2015年第11期|共6页
作者
Lau Chee Yong; Oliver Watts; Simon King;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Combining Lightly-supervised Learning and User Feedback to Construct Andimprove a Statistical Parametric Speech Synthesizer for Malay [J] . Lau Chee Yong, Oliver Watts, Simon King Research journal of applied science, engineering and technology . 2015,第11期

机译：结合轻度监督学习和用户反馈来构建和改进马来语统计参数语音合成器
2. LOW FOOTPRINT HIGH INTELLIGIBILITY MALAY SPEECH SYNTHESIZER BASED ON STATISTICAL DATA [J] . Lau Chee Yong, Tan Tian Swee Journal of computer sciences . 2014,第2期

机译：基于统计数据的低足迹高智能马来语语音合成器
3. LOW FOOTPRINT HIGH INTELLIGIBILITY MALAY SPEECH SYNTHESIZER BASED ON STATISTICAL DATA | Science Publications [J] . Lau Chee Yong, Tan Tian Swee Journal of computer sciences . 2013,第2期

机译：基于统计数据的低足迹高智能马来语语音合成器|科学出版物
4. Fast, Compact, and High Quality LSTM-RNN Based Statistical Parametric Speech Synthesizers for Mobile Devices [C] . Heiga Zen, Yannis Agiomyrgiannakis, Niels Egberts, Annual Conference of the International Speech Communication Association . 2016

机译：基于快速，紧凑，高品质的LSTM-RNN用于移动设备的基于统计参数语音合成器
5. Statistical Parametric Speech Synthesis using Deep Learning Architectures [D] . Kang, Shiyin. 2016

机译：使用深度学习架构的统计参数致辞
6. Discriminative Multi-Stream Postfilters Based on Deep Learning for Enhancing Statistical Parametric Speech Synthesis [O] . Marvin Coto-Jiménez 2021

机译：基于深度学习的判别多流破旧用于增强统计参数致辞综合
7. LOW FOOTPRINT HIGH INTELLIGIBILITY MALAY SPEECH SYNTHESIZER BASED ON STATISTICAL DATA [O] . Lau Chee Yong, Tan Tian Swee 2015

机译：基于统计数据的低容量高智能马氏语音合成器

Combining Lightly-supervised Learning and User Feedback to Construct and Improve a Statistical Parametric Speech Synthesizer for Malay

摘要

著录项

相似文献

相关主题

期刊订阅