首页> 外国专利> Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these

Method and apparatus for discriminative estimation of parameters in maximum a posteriori (MAP) speaker adaptation condition and voice recognition method and apparatus including these

机译:区分估计最大后验(MAP)说话者适应条件中的参数的方法和设备以及包括这些参数的语音识别方法和设备

摘要

A method and apparatus for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, and a voice recognition apparatus having the apparatus and a voice recognition method using the method are provided. The method for discriminative estimation of parameters in a maximum a posteriori (MAP) speaker adaptation condition, in which at least speaker-independent model parameters and prior density parameters, which are standards in recognizing a speaker's voice, are obtained as the result of model training after fetching training sets on a plurality of speakers from a training database, has the steps of (a) classifying adaptation data among training sets for respective speakers; (b) obtaining model parameters adapted from adaptation data on each speaker by using the initial values of the parameters; (c) searching a plurality of candidate hypotheses on each uttered sentence of training sets by using the adapted model parameters, and calculating gradients of speaker-independent model parameters by measuring the degree of errors on each training sentence; and (d) when training sets of all speakers are adapted, updating parameters, which were set at the initial stage, based on the calculated gradients.
机译:提供了一种用于在最大后验(MAP)说话者适应条件下有区别地估计参数的方法和设备,以及具有该设备和使用该方法的语音识别方法的语音识别设备。用于在最大后验(MAP)说话者适应条件下进行有区别的参数估计的方法,其中至少得到与说话者无关的模型参数和先验密度参数,这是识别说话者声音的标准,是模型训练的结果在从训练数据库中获取多个说话者的训练集之后,具有以下步骤:(a)在各个说话者的训练集之间对适应数据进行分类; (b)通过使用参数的初始值,从每个说话者的适应数据获得适应的模型参数; (c)通过使用调整后的模型参数在训练集的每个发音句子上搜索多个候选假设,并通过测量每个训练句子的错误程度来计算与说话者无关的模型参数的梯度; (d)当所有说话者的训练集都适应时,根据计算出的梯度更新在初始阶段设置的参数。

著录项

  • 公开/公告号US2005065793A1

    专利类型

  • 公开/公告日2005-03-24

    原文格式PDF

  • 申请/专利权人 IN-JEONG CHOI;SANG-RYONG KIM;

    申请/专利号US20040898382

  • 发明设计人 IN-JEONG CHOI;SANG-RYONG KIM;

    申请日2004-07-26

  • 分类号G10L15/12;G10L19/12;

  • 国家 US

  • 入库时间 2022-08-21 22:23:15

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号