Objective measures to improve the selection of training speakers in HMM-based child speech synthesis

机译：改进汉姆培育综合培训人员培训师选择的客观措施

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Building synthetic child voices is considered a difficult task due to the challenges associated with data collection. As a result, speaker adaptation in conjunction with Hidden Markov Model (HMM)-based synthesis has become prevalent in this domain because the approach caters for limited amounts of data. An initial average voice model is trained using data from multiple speakers and adapted to resemble a specific target child speaker. Due to the scarcity of child speech data, initial models used in this approach are mostly trained with adult speech data. However, selection of appropriate training speakers from large corpora is not a trivial task because there is no means, other than conducting exhaustive subjective listening tests, to determine which training speakers will yield the best quality synthetic child voice. Therefore, there is a need to find an objective measure that can be used to easily identify a small set of training speakers that will yield the best quality output. In this paper we investigate whether a relationship exists between objective and subjective voice evaluation measures with regard to the selection of training speakers for an average voice model used in speaker-adaptive HMM child speech synthesis. Results indicate that, if training speakers that are closer to the target speaker are used to train initial models, better quality child voices are generated.

机译：由于与数据收集相关的挑战，构建合成儿童声音被认为是一项艰巨的任务。结果，与隐马尔可夫模型（HMM）的合成结合的扬声器适应在该领域中普遍存在域中，因为该方法能够满足有限的数据量。使用来自多个扬声器的数据训练初始平均语音模型，并适用于类似于特定的目标儿童扬声器。由于儿童语音数据的稀缺性，这种方法中使用的初始模型主要由成人语音数据培训。然而，从大型语料库中选择适当的培训扬声器不是一个琐碎的任务，因为除了进行详尽的主观听力测试之外，没有任何方法，确定哪些培训扬声器将产生最优质的合成子声音。因此，需要找到一个客观措施，可以用于轻松识别一小组训练扬声器，这将产生最佳质量输出。在本文中，我们调查了对客观和主观语音评估措施之间的关系是否存在关于扬声器 - 自适应HMM儿童语音合成的平均语音模型的训练扬声器之间的目标和主观语音评估措施。结果表明，如果越来越靠近目标扬声器的培训扬声器用于培训初始模型，则会生成更好的素质儿童声音。

著录项

来源
《International Conference on Pattern Recognition Association of South Africa and Robotics and Mechatronics》|2016年|1 v.|共6页
会议地点
作者
Avashna Govender; Febe de Wet;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TH15;
关键词
Hidden Markov models; Speech; Adaptation models; Speech synthesis; Training; Data models; Databases;

机译：隐马尔可夫模型;语音;适应模型;语音合成;培训;数据模型;数据库;

相似文献

外文文献
中文文献
专利

1. Speech-Input Speech-Output Communication for Dysarthric Speakers Using HMM-Based Speech Recognition and Adaptive Synthesis System [J] . Dhanalakshmi M., Celin T. A. Mariya, Nagarajan T., Circuits, systems, and signal processing . 2018,第2期

机译：基于HMM的语音识别和自适应合成系统的韵律演讲者的语音输入语音输出通信
2. Personalising speech-to-speech translation: Unsupervised cross-lingual speaker adaptation for HMM-based speech synthesis [J] . John Dines, Hui Liang, Lakshmi Saheer, Computer speech and language . 2013,第2期

机译：个性化语音到语音翻译：基于HMM的语音合成的无监督跨语言说话者自适应
3. Prosody Correction Preserving Speaker Individuality for Chinese-Accented Japanese HMM-Based Text-to-Speech Synthesis [J] . Daiki SEKIZAWA, Shinnosuke TAKAMICHI, Hiroshi SARUWATARI IEICE transactions on information and systems . 2019,第6期

机译：基于汉字的日本HMM语音合成中保留韵律校正的说话人个性
4. Objective measures to improve the selection of training speakers in HMM-based child speech synthesis [C] . Avashna Govender, Febe de Wet International Conference on Pattern Recognition Association of South Africa and Robotics and Mechatronics . 2016

机译：在基于HMM的儿童语音合成中改进训练说话者选择的客观措施
5. HMM-based non-intrusive speech quality and implementation of Viterbi score distribution and hiddenness based measures to improve the performance of speech recognition [D] . Talwar, Gaurav 2006

机译：基于HMM的非侵入式语音质量以及基于Viterbi分数分布和隐蔽性的措施的实施，以提高语音识别的性能
6. Using Patient Reported Outcome Measures to Improve Service Effectiveness (UPROMISE): Training clinicians to Use Outcome Measures in Child Mental Health [O] . Julian Edbrooke-Childs, Miranda Wolpert, Jessica Deighton -1

机译：使用患者报告的结果措施来提高服务效率（UPROMISE）：培训临床医生在儿童心理健康中使用结果措施
7. HMM-Based Distributed Text-to-Speech Synthesis Incorporating Speaker-Adaptive Training [O] . Kwang Myung Jeon, Seung Ho Choi 2015

机译：基于Hmm的分布式文本到语音合成结合扬声器自适应训练

Objective measures to improve the selection of training speakers in HMM-based child speech synthesis

摘要

著录项

相似文献

相关主题

期刊订阅