【24h】

Corpus-Based Techniques in the AtT Nextgen Synthesis System

机译:At&T Nextgen综合系统中基于语料库的技术

获取原文

摘要

The AT&T text-to-speech (TTS) synthesis system has been used as a framework for experimetning with a perceptually-guided data-driven approach to speech synthesis, with primary focus on data-driven elements in the "back end". Statistical training techniques applied to a large corpus are used to make decisions about predicted speech events and selected speech inventory units. Our recent advances in automatic phonetic and prosodic labeling and a new faster harmonic plus noise model (HNM) and unit preselection implementations have significantly improved TTS quality and speeded up both development time and runtime.
机译:AT&T文本语音转换(TTS)合成系统已被用作一种框架,用于体验性地指导数据驱动的语音合成方法,主要关注“后端”中的数据驱动元素。应用于大型语料库的统计培训技术用于制定有关预测语音事件和选定语音清单单元的决策。我们在自动语音和韵律标记方面的最新进展以及新的更快的谐波加噪声模型(HNM)和单元预选实现已大大改善了TTS的质量,并加快了开发时间和运行时间。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号