首页> 外国专利> Text-to-Speech Adapted by Machine Learning

Text-to-Speech Adapted by Machine Learning

机译：机器学习改编的文本到语音

页面导航

摘要
著录项
相似文献

摘要

Machine learned models take in vectors representing desired behaviors and generate voice vectors that provide the parameters for text-to-speech (TTS) synthesis. Models may be trained on behavior vectors that include user profile attributes, situational attributes, or semantic attributes. Situational attributes may include age of people present, music that is playing, location, noise, and mood. Semantic attributes may include presence of proper nouns, number of modifiers, emotional charge, and domain of discourse. TTS voice parameters may apply per utterance and per word as to enable contrastive emphasis.

机译：机器学习的模型接收代表所需行为的向量，并生成语音向量，该向量为文本到语音（TTS）合成提供参数。可以在包括用户档案属性，情境属性或语义属性的行为向量上训练模型。情境属性可能包括在场人员的年龄，正在播放的音乐，位置，噪音和心情。语义属性可能包括专有名词的存在，修饰语的数量，情感冲动和话语范围。 TTS语音参数可能适用于每个发音和每个单词，以实现对比强调。

著录项

公开/公告号US2020151394A1

专利类型
公开/公告日2020-05-14

原文格式PDF
申请/专利权人 SOUNDHOUND INC.;
展开▼

申请/专利号US202016742006
发明设计人 BERNARD MONT-REYNAUD;MONIKA ALMUDAFAR-DEPEYROT;
展开▼

申请日2020-01-14
分类号G06F40/30;G10L13;
国家 US
入库时间 2022-08-21 11:24:57

相似文献

专利
外文文献
中文文献