首页> 外国专利> Method for reduced computation of t-matrix training for speaker recognition

Method for reduced computation of t-matrix training for speaker recognition

机译：减少扬声器识别的T矩阵训练计算的方法

页面导航

摘要
著录项
相似文献

摘要

A system and method for improving T-matrix training for speaker recognition are provided. The method includes receiving an audio input, divisible into a plurality of audio frames, wherein at least a first audio frame includes an audio sample of a human speaker, the sample having a length above a first threshold; generating for each audio frame a feature vector; generating for a first plurality of feature vectors centered statistics of at least a zero order and a first order; generating a first i-vector, the first i-vector representing the human speaker; generating an optimized T-matrix training sequence computation, based on the first i-vector, an initialized T-matrix, the centered statistics, and a Gaussian mixture model (GMM) of a trained universal background model (UBM).

机译：提供了一种改进扬声器识别的T矩阵训练的系统和方法。该方法包括接收到可分割到多个音频帧中的音频输入，其中至少第一音频帧包括人扬声器的音频样本，样品具有高于第一阈值的长度;为每个音频帧生成特征向量;生成第一多个特征向量以至少零级和第一顺序为中心的统计数据;生成第一I载体，是代表人类扬声器的第一个I形式;基于第一I-Vector，初始化的T矩阵，中心统计和培训的通用背景模型（UBM）的高斯混合模型（GMM）生成优化的T矩阵训练序列计算。

著录项

公开/公告号US10950243B2

专利类型
公开/公告日2021-03-16

原文格式PDF
申请/专利权人 ILLUMA LABS INC.;
展开▼

申请/专利号US201916290399
发明设计人 MILIND BORKAR;
展开▼

申请日2019-03-01
分类号G10L17/04;G10L17/02;
国家 US
入库时间 2022-08-24 17:42:49

相似文献

专利
外文文献
中文文献