首页> 外国专利> FEATURE SPACE TRANSFORMATION FOR PERSONALIZATION USING GENERALIZED I-VECTOR CLUSTERING

FEATURE SPACE TRANSFORMATION FOR PERSONALIZATION USING GENERALIZED I-VECTOR CLUSTERING

机译:基于广义I-向量簇的个性化特征空间变换

摘要

Personalization for Automatic Speech Recognition (ASR) is associated with a particular device. A generalized i-vector clustering method is used to train i-vector parameters on utterances received from a device and to classify test utterances from the same device. A sub-loading matrix and a residual noise term may be used when determining the personalization. A Universal Background Model (UBM) is trained using the utterances. The UBM is applied to obtain i-vectors of training utterances received from a device and a Gaussian Mixture Model (GMM) is trained using the i-vectors. During testing, the i-vector for each utterance received from the device is estimated using the device's UBM. The utterance is then assigned to the cluster with the closest centroid in the GMM. For each utterance, the i-vector and the residual noise estimation is performed. Hyperparameter estimation is also performed. The i-vector estimation and hyperparameter estimation are performed until convergence.
机译:自动语音识别(ASR)的个性化与特定设备相关联。通用的i-vector聚类方法用于训练从设备接收的话语的i-向量参数,并对来自同一设备的测试话语进行分类。当确定个性化时,可以使用子负载矩阵和残余噪声项。使用话语训练通用背景模型(UBM)。应用UBM以获得从设备接收的训练话语的i-向量,并使用i-向量对高斯混合模型(GMM)进行训练。在测试期间,使用设备的UBM估算从设备接收到的每个发音的i矢量。然后,将话语分配给GMM中具有最接近质心的聚类。对于每个话语,执行i矢量和残留噪声估计。还执行超参数估计。进行i矢量估计和超参数估计直到收敛。

著录项

  • 公开/公告号US2014214420A1

    专利类型

  • 公开/公告日2014-07-31

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号US201313750870

  • 发明设计人 KAISHENG YAO;YIFAN GONG;

    申请日2013-01-25

  • 分类号G10L15/06;

  • 国家 US

  • 入库时间 2022-08-21 16:06:55

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号