首页> 外国专利> Computer system realizing unsupervised speaker adaptation of DNN speech synthesis, method and program executed in the computer system

Computer system realizing unsupervised speaker adaptation of DNN speech synthesis, method and program executed in the computer system

机译:实现DNN语音合成的无监督说话者自适应的计算机系统,在该计算机系统中执行的方法和程序

摘要

The computer system 1 includes a speaker information estimation unit 130 that estimates speaker information of an unknown speaker based on the acoustic feature amount of the unknown speaker without requiring input of text as teacher data. The speaker information of the unknown speaker includes a speaker code that represents the degree of similarity between the distribution of acoustic feature amounts of the unknown speaker and the distribution of acoustic feature amounts of a plurality of known speakers. The computer system 1 uses a multi-speaker acoustic model (DNN) 230 to synthesize acoustic features of an unknown speaker based on the language feature amount of the input text and the speaker information of the unknown speaker. A synthetic acoustic feature generation unit 220 that generates a quantity and a synthetic speech generation unit 240 that generates a synthesized voice of the unknown speaker based on the synthesized acoustic feature quantity of the unknown speaker.
机译:计算机系统1包括说话者信息估计单元130,该说话者信息估计单元130基于未知说话者的声学特征量来估计未知说话者的说话者信息,而不需要输入文本作为教师数据。未知扬声器的扬声器信息包括扬声器代码,该扬声器代码表示未知扬声器的声学特征量的分布与多个已知扬声器的声学特征量的分布之间的相似度。计算机系统1使用多扬声器声学模型(DNN)230基于输入文本的语言特征量和未知扬声器的扬声器信息来合成未知扬声器的声学特征。生成数量的合成声学特征生成单元220和基于未知说话者的合成声学特征量生成未知说话者的合成语音的合成语音生成单元240。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号