首页> 外国专利> TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS

TEXT-TO-SPEECH FROM MEDIA CONTENT ITEM SNIPPETS

机译：来自媒体内容项片段的文本到语音

页面导航

摘要
著录项
相似文献

摘要

A text-to-speech engine creates audio output that includes synthesized speech and one or more media content item snippets. The input text is obtained and partitioned into text sets. A track having lyrics that match a part of one of the text sets is identified. The location of the track's audio that contains the lyric is extracted based on forced alignment data. The extracted audio is combined with synthesized speech corresponding to the remainder of the input text to form audio output.

机译：文本到语音引擎创建音频输出，包括合成语音和一个或多个媒体内容项片段。获取输入文本并将其分区为文本集。识别具有符合其中一个文本集的歌词的曲目。基于强制对准数据提取包含歌词的轨道音频的位置。提取的音频与对应于输入文本的其余部分的合成语音组合以形成音频输出。

著录项

公开/公告号EP3872806A1

专利类型
公开/公告日2021-09-01

原文格式PDF
申请/专利权人 SPOTIFY AB;
展开▼

申请/专利号EP20210167170
发明设计人 KUMAR ROHIT;LINDSTRÖM HENRIK;CRAMER HENRIETTE;MENNICKEN SARAH;REDDY SRAVANA;THOM-SANTELLI JENNIFER;
展开▼

申请日2019-12-09
分类号G10L13/06;G10L13/02;
国家 EP
入库时间 2022-08-24 22:22:28

相似文献

专利
外文文献
中文文献