首页> 外国专利> Computer system realizing unsupervised speaker adaptation of DNN speech synthesis, method and program executed in the computer system

Computer system realizing unsupervised speaker adaptation of DNN speech synthesis, method and program executed in the computer system

机译：实现DNN语音合成的无监督说话者自适应的计算机系统，在该计算机系统中执行的方法和程序

AI期刊论文写作 >>

页面导航

摘要
著录项
相似文献

摘要

The computer system 1 includes a speaker information estimation unit 130 that estimates speaker information of an unknown speaker based on the acoustic feature amount of the unknown speaker without requiring input of text as teacher data. The speaker information of the unknown speaker includes a speaker code that represents the degree of similarity between the distribution of acoustic feature amounts of the unknown speaker and the distribution of acoustic feature amounts of a plurality of known speakers. The computer system 1 uses a multi-speaker acoustic model (DNN) 230 to synthesize acoustic features of an unknown speaker based on the language feature amount of the input text and the speaker information of the unknown speaker. A synthetic acoustic feature generation unit 220 that generates a quantity and a synthetic speech generation unit 240 that generates a synthesized voice of the unknown speaker based on the synthesized acoustic feature quantity of the unknown speaker.

机译：计算机系统1包括说话者信息估计单元130，该说话者信息估计单元130基于未知说话者的声学特征量来估计未知说话者的说话者信息，而不需要输入文本作为教师数据。未知扬声器的扬声器信息包括扬声器代码，该扬声器代码表示未知扬声器的声学特征量的分布与多个已知扬声器的声学特征量的分布之间的相似度。计算机系统1使用多扬声器声学模型（DNN）230基于输入文本的语言特征量和未知扬声器的扬声器信息来合成未知扬声器的声学特征。生成数量的合成声学特征生成单元220和基于未知说话者的合成声学特征量生成未知说话者的合成语音的合成语音生成单元240。

著录项

公开/公告号JPWO2019044401A1

专利类型
公开/公告日2019-11-07

原文格式PDF
申请/专利权人大学共同利用機関法人情報・システム研究機構;
展开▼

申请/专利号JP20180568997
发明设计人山岸順一;高木信二;
展开▼

申请日2018-08-06
分类号G10L13/10;
国家 JP
入库时间 2022-08-21 12:19:00

相似文献

专利
外文文献
中文文献