首页> 外国专利> SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES

SYSTEMS AND METHODS FOR GENERATING MULTI-LANGUAGE MEDIA CONTENT WITH AUTOMATIC SELECTION OF MATCHING VOICES

机译：具有生成多语言媒体内容的系统和方法，具有自动选择匹配的声音

页面导航

摘要
著录项
相似文献

摘要

A method and system for automated voice casting compares candidate voices samples from candidate speakers in a target language with a primary voice sample from a primary speaker in a primary language. Utterances in the audio samples of the candidates speakers and the primary speaker are identified and typed and voice samples generated that meet applicable utterance type criteria. A neural network is used to generate an embedding for the voice samples. A voice sample can include groups of different utterance types and embeddings generated for each utterance group in the voice sample and then combined in a weighted form wherein the resulting embedding emphasizes selected utterance types. Similarities between embeddings for the candidate voice samples relative to the primary voice sample are evaluated and used to select a candidate speaker that is a vocal match.

机译：自动语音铸造的方法和系统将候选语音从候选语言中的候选语言与主要语言中的主语音样本中的候选语言中的样本进行比较。识别候选扬声器和初级扬声器的音频样本中的话语，并键入和产生的语音样本，其符合适用的话语类型标准。神经网络用于生成语音样本的嵌入。语音样本可以包括用于语音样本中的每个话语组的不同话语类型和嵌入的组，然后以加权形式组合，其中产生的嵌入强调所选择的话语类型。评估相对于主语音样本的候选语音样本的嵌入物之间的相似性，并用于选择作为声音匹配的候选扬声器。

著录项

公开/公告号US2021279427A1

专利类型
公开/公告日2021-09-09

原文格式PDF
申请/专利权人 WARNER BROS. ENTERTAINMENT INC.;
展开▼

申请/专利号US202117196285
发明设计人 AANSH MALIK;HA THANH NGUYEN;
展开▼

申请日2021-03-09
分类号G06F40/47;G06F40/58;G10L15/16;G10L15;
国家 US
入库时间 2022-08-24 20:56:04

相似文献

专利
外文文献
中文文献