首页> 外国专利> FULLY SUPERVISED SPEAKER DIARIZATION

FULLY SUPERVISED SPEAKER DIARIZATION

机译:完全监督扬声器日益改血

摘要

A method includes receiving an utterance of speech and segmenting the utterance of speech into a plurality of segments. For each segment of the utterance of speech, the method also includes extracting a speaker=discriminative embedding from the segment and predicting a probability distribution over possible speakers for the segment using a probabilistic generative model configured to receive the extracted speaker-discriminative embedding as a feature input. The probabilistic generative model trained on a corpus of training speech utterances each segmented into a plurality of training segments. Each training segment including a corresponding speaker-discriminative embedding and a corresponding speaker label. The method also includes assigning a speaker label to each segment of the utterance of speech based on the probability distribution over possible speakers for the corresponding segment.
机译:一种方法包括接收语音的话语并将语音的话语分割成多个段。 对于语音的每个段,该方法还包括从段中提取扬声器=从段中的判别嵌入,并使用被配置为接收提取的扬声器鉴别嵌入作为特征的提取的扬声器鉴别嵌入的可能扬声器的概率分布 输入。 概率的生成模型在训练语料库上培训,每个讲话话语的讲话中被分段为多个训练段。 每个培训段包括相应的扬声器歧视性嵌入和相应的扬声器标签。 该方法还包括将扬声器标签基于对相应段的可能扬声器的概率分布分配给语音的每个段。

著录项

  • 公开/公告号EP3891733A1

    专利类型

  • 公开/公告日2021-10-13

    原文格式PDF

  • 申请/专利权人 GOOGLE LLC;

    申请/专利号EP20190817837

  • 申请日2019-11-12

  • 分类号G10L17/04;G10L17/18;G10L25/87;G10L25/30;

  • 国家 EP

  • 入库时间 2022-08-24 21:39:14

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号