首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Bootstrapping Text-to-Speech for speech processing in languages without an orthography
【24h】

Bootstrapping Text-to-Speech for speech processing in languages without an orthography

机译:引导文本到语音进行语音处理,而无需拼写法

获取原文

摘要

Speech synthesis technology has reached the stage where given a well-designed corpus of audio and accurate transcription an at least understandable synthesizer can be built without necessarily resorting to new innovations. However many languages do not have a well-defined writing system but such languages could still greatly benefit from speech systems. In this paper we consider the case where we have a (potentially large) single speaker database but have no transcriptions and no standardized way to write transcriptions. To address this scenario we propose a method that allows us to bootstrap synthetic voices purely from speech data. We use a novel combination of automatic speech recognition and automatic word segmentation for the bootstrapping. Our experimental results on speech corpora in two languages, English and German, show that synthetic voices that are built using this method are close to understandable. Our method is language-independent and can thus be used to build synthetic voices from a speech corpus in any new language.
机译:语音合成技术已到达阶段,其中可以建立一个设计的音频和准确转录精力和准确转录的阶段,可以建立至少可理解的合成器,而无需诉诸新的创新。然而,许多语言没有明确定义的写作系统,但这种语言仍然可以从语音系统中受益匪浅。在本文中,我们考虑到我们拥有(潜在大的)单个扬声器数据库但没有转录,没有标准化的方式来编写转录的情况。要解决此方案,我们提出了一种方法,允许我们纯粹从语音数据引导合成声音。我们使用自动语音识别和自动字分割的新组合进行引导。我们在两种语言,英语和德语的演讲语料库上的实验结果表明,使用此方法构建的合成声音靠近可理解。我们的方法是独立的语言,因此可以用于从任何新语言中从语音语料库构建合成声音。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号