首页> 外国专利> SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE

SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE

机译：自学说话人自适应类型的语音识别方法

页面导航

摘要
著录项
相似文献

摘要

PURPOSE: To reduce unsupervised segmentation error and to facilitate a succeeding phone model adaptive execution by eliminating an acoustic spectrum fluctuation source casing recognition performance deterioration by decomposing the spectrum fluctuation source. ;CONSTITUTION: In a training side 10, a spectrum bias (h) is subtracted from a training speech spectrum X_t of the speaker in a logarithmic domain to generate a set of a normalized spectrum, and is made into a model in a process 26 to generate the models M2, M3 of a normalized unspecified speaker. The normalized phone models M2, M3 are supplied to a decoder 30, and are used for decoding the test speech of the speaker (q). Before the speaker (q) recognized a sentence, short generation of a proofreading speech X_c is supplied to an h- estimater 24, and the estimated spectrum bias h^(q) for speaker is generated, and it is subtracted from the training speech spectrum X_t. A bias parameter generates the normalized spectrum, and the normalized spectrum is supplied to the decoder 30 to constitute a word line.;COPYRIGHT: (C)1996,JPO

机译：目的：通过消除频谱波动源，消除声学频谱波动源机壳识别性能的下降，以减少无监督的分割错误并促进后续电话模型的自适应执行。 ;构成：在训练侧10中，从对数域中说话者的训练语音频谱X _{t 中减去频谱偏差（h），以生成一组归一化频谱，并使其在过程26中将其转化为模型以生成归一化未指定说话者的模型M2，M3。归一化的电话模型M2，M3被提供给解码器30，并且被用于解码说话者（q）的测试语音。在说话者（q）识别句子之前，将简短的校对语音X _{c 的生成提供给h估计器24，并且估计的频谱偏差h ^{（q）生成说话人的语音，并从训练语音频谱X _{t 中减去。偏置参数产生归一化频谱，并且将归一化频谱提供给解码器30以构成字线。版权所有：（C）1996，JPO}}}}

著录项

公开/公告号JPH0863182A

专利类型
公开/公告日1996-03-08

原文格式PDF
申请/专利权人 MATSUSHITA ELECTRIC IND CO LTD;
展开▼

申请/专利号JP19950206511
发明设计人 YANKIN TSUAO;
展开▼

申请日1995-07-19
分类号G10L3/00;G10L3/02;
国家 JP
入库时间 2022-08-22 03:55:35

相似文献

专利
外文文献
中文文献