首页> 外国专利> SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE

SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE

机译:自学说话人自适应类型的语音识别方法

摘要

PURPOSE: To reduce unsupervised segmentation error and to facilitate a succeeding phone model adaptive execution by eliminating an acoustic spectrum fluctuation source casing recognition performance deterioration by decomposing the spectrum fluctuation source. ;CONSTITUTION: In a training side 10, a spectrum bias (h) is subtracted from a training speech spectrum Xt of the speaker in a logarithmic domain to generate a set of a normalized spectrum, and is made into a model in a process 26 to generate the models M2, M3 of a normalized unspecified speaker. The normalized phone models M2, M3 are supplied to a decoder 30, and are used for decoding the test speech of the speaker (q). Before the speaker (q) recognized a sentence, short generation of a proofreading speech Xc is supplied to an h- estimater 24, and the estimated spectrum bias h(q) for speaker is generated, and it is subtracted from the training speech spectrum Xt. A bias parameter generates the normalized spectrum, and the normalized spectrum is supplied to the decoder 30 to constitute a word line.;COPYRIGHT: (C)1996,JPO
机译:目的:通过消除频谱波动源,消除声学频谱波动源机壳识别性能的下降,以减少无监督的分割错误并促进后续电话模型的自适应执行。 ;构成:在训练侧10中,从对数域中说话者的训练语音频谱X t 中减去频谱偏差(h),以生成一组归一化频谱,并使其在过程26中将其转化为模型以生成归一化未指定说话者的模型M2,M3。归一化的电话模型M2,M3被提供给解码器30,并且被用于解码说话者(q)的测试语音。在说话者(q)识别句子之前,将简短的校对语音X c 的生成提供给h估计器24,并且估计的频谱偏差h (q)生成说话人的语音,并从训练语音频谱X t 中减去。偏置参数产生归一化频谱,并且将归一化频谱提供给解码器30以构成字线。版权所有:(C)1996,JPO

著录项

  • 公开/公告号JPH0863182A

    专利类型

  • 公开/公告日1996-03-08

    原文格式PDF

  • 申请/专利权人 MATSUSHITA ELECTRIC IND CO LTD;

    申请/专利号JP19950206511

  • 发明设计人 YANKIN TSUAO;

    申请日1995-07-19

  • 分类号G10L3/00;G10L3/02;

  • 国家 JP

  • 入库时间 2022-08-22 03:55:35

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号