首页> 外文会议>2017 Pattern Recognition Association of South Africa and Robotics and Mechatronics >Building synthetic voices for under-resourced languages: The feasibility of using audiobook data
【24h】

Building synthetic voices for under-resourced languages: The feasibility of using audiobook data

机译:为资源不足的语言构建合成语音:使用有声读物数据的可行性

获取原文
获取原文并翻译 | 示例

摘要

Creating synthetic voices that are natural and intelligible is a daunting challenge for well-resourced languages. The challenge is much bigger for languages in which the speech and text resources required for voice development are not available. Previous studies have suggested audiobooks as an alternative source of speech data. This paper reports on a comparison between voices derived from audiobook data and voices based on professional voice artist data. Two sets of voices were evaluated: male voices built using very small amounts of both data types (around 3 hours, representing a severely resource constrained scenario) and female voices trained on almost 10 hours of audiobook and professional speech data. The results of subjective listening tests indicate that, while the majority of the listeners preferred the voice artists' voices over the audiobook voices, the difference in naturalness was not perceived to be substantial. Results also showed that the artists' voices outperform the audiobook voices in terms of intelligibility, especially if a limited amount of training data is available. Although additional training data improves the intelligibility of audiobook voices, the results seem to indicate that a smaller quantity of professional data yields a better voice than large volumes of especially old audiobook data.
机译:对于资源丰富的语言而言,创建自然且可理解的合成声音是一项艰巨的挑战。对于没有语音开发所需的语音和文本资源的语言,挑战更大。先前的研究建议有声读物作为语音数据的替代来源。本文报告了有声书数据和基于专业语音艺术家数据的语音之间的比较。评估了两组声音:使用非常少量的两种数据类型(大约3小时,表示资源严重受限的场景)构建的男性声音,以及使用近10个小时的有声读物和专业语音数据训练的女性声音。主观听觉测试的结果表明,尽管大多数听众都喜欢语音艺术家的声音而不是有声书的声音,但自然的差异并没有被认为是实质性的。结果还表明,在清晰度方面,艺术家的声音优于有声书的声音,尤其是在培训数据有限的情况下。尽管其他培训数据可以提高有声读物语音的清晰度,但结果似乎表明,与大量特别是旧有声书数据相比,少量的专业数据可产生更好的语音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号