【24h】

When will synthetic speech sound human: Role of rules and data

机译:人工语音何时会发出声音:规则和数据的作用

获取原文

摘要

Text-to-speech synthesis research has moved away from building general purpose systems based on an understanding of human language and speech production towards building systems based on statistical algorithms applied to large text and speech corpora, and, recently, towards building such systems for specific domains. Despite substantial progress, the overall quality of even the best systems is often still inadequate for broad user acceptance in applications that cannot also be handled with simple phrase splicing. This tutorial paper analyzes which problems must be addressed to achieve the goal of generating natural-sounding speech in limited domains in a cost-effective way, and the roles of data and rules as we work towards solutions.
机译:文本到语音的综合研究已从基于对人类语言和语音产生的理解而构建通用系统,转向基于适用于大型文本和语音语料库的统计算法的构建系统,近来已针对特定的语言构建此类系统。域。尽管取得了长足的进步,但即使是最好的系统,其整体质量也常常不足以使用户无法通过简单的短语拼接来处理的应用程序中获得广泛的用户认可。本教程分析了必须解决的问题,以实现以成本有效的方式在有限的范围内生成自然声音语音的目标,以及我们在寻求解决方案时数据和规则的作用。

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号