首页>
外国专利>
SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE
SPEECH RECOGNITION METHOD OF SELF LEARNING SPEAKER ADAPTATION TYPE
展开▼
机译:自学说话人自适应类型的语音识别方法
展开▼
页面导航
摘要
著录项
相似文献
摘要
PURPOSE: To reduce unsupervised segmentation error and to facilitate a succeeding phone model adaptive execution by eliminating an acoustic spectrum fluctuation source casing recognition performance deterioration by decomposing the spectrum fluctuation source. ;CONSTITUTION: In a training side 10, a spectrum bias (h) is subtracted from a training speech spectrum Xt of the speaker in a logarithmic domain to generate a set of a normalized spectrum, and is made into a model in a process 26 to generate the models M2, M3 of a normalized unspecified speaker. The normalized phone models M2, M3 are supplied to a decoder 30, and are used for decoding the test speech of the speaker (q). Before the speaker (q) recognized a sentence, short generation of a proofreading speech Xc is supplied to an h- estimater 24, and the estimated spectrum bias h(q) for speaker is generated, and it is subtracted from the training speech spectrum Xt. A bias parameter generates the normalized spectrum, and the normalized spectrum is supplied to the decoder 30 to constitute a word line.;COPYRIGHT: (C)1996,JPO
展开▼
机译:目的:通过消除频谱波动源,消除声学频谱波动源机壳识别性能的下降,以减少无监督的分割错误并促进后续电话模型的自适应执行。 ;构成:在训练侧10中,从对数域中说话者的训练语音频谱X t Sub>中减去频谱偏差(h),以生成一组归一化频谱,并使其在过程26中将其转化为模型以生成归一化未指定说话者的模型M2,M3。归一化的电话模型M2,M3被提供给解码器30,并且被用于解码说话者(q)的测试语音。在说话者(q)识别句子之前,将简短的校对语音X c Sub>的生成提供给h估计器24,并且估计的频谱偏差h (q) Sup>生成说话人的语音,并从训练语音频谱X t Sub>中减去。偏置参数产生归一化频谱,并且将归一化频谱提供给解码器30以构成字线。版权所有:(C)1996,JPO
展开▼