首页> 外国专利> FULLY SUPERVISED SPEAKER DIARIZATION

FULLY SUPERVISED SPEAKER DIARIZATION

机译：完全监督扬声器日益改血

页面导航

摘要
著录项
相似文献

摘要

A method includes receiving an utterance of speech and segmenting the utterance of speech into a plurality of segments. For each segment of the utterance of speech, the method also includes extracting a speaker=discriminative embedding from the segment and predicting a probability distribution over possible speakers for the segment using a probabilistic generative model configured to receive the extracted speaker-discriminative embedding as a feature input. The probabilistic generative model trained on a corpus of training speech utterances each segmented into a plurality of training segments. Each training segment including a corresponding speaker-discriminative embedding and a corresponding speaker label. The method also includes assigning a speaker label to each segment of the utterance of speech based on the probability distribution over possible speakers for the corresponding segment.

机译：一种方法包括接收语音的话语并将语音的话语分割成多个段。对于语音的每个段，该方法还包括从段中提取扬声器=从段中的判别嵌入，并使用被配置为接收提取的扬声器鉴别嵌入作为特征的提取的扬声器鉴别嵌入的可能扬声器的概率分布输入。概率的生成模型在训练语料库上培训，每个讲话话语的讲话中被分段为多个训练段。每个培训段包括相应的扬声器歧视性嵌入和相应的扬声器标签。该方法还包括将扬声器标签基于对相应段的可能扬声器的概率分布分配给语音的每个段。

著录项

公开/公告号EP3891733A1

专利类型
公开/公告日2021-10-13

原文格式PDF
申请/专利权人 GOOGLE LLC;
展开▼

申请/专利号EP20190817837
发明设计人 WANG CHONG;ZHANG AONAN;WANG QUAN;ZHU ZHENYAO;
展开▼

申请日2019-11-12
分类号G10L17/04;G10L17/18;G10L25/87;G10L25/30;
国家 EP
入库时间 2022-08-24 21:39:14

相似文献

专利
外文文献
中文文献