首页> 外国专利> SPEECH RECOGNITION SYSTEM EMPLOYING DISCRIMINATIVELY TRAINED MODELS

SPEECH RECOGNITION SYSTEM EMPLOYING DISCRIMINATIVELY TRAINED MODELS

机译：运用差异化训练模型的语音识别系统

页面导航

摘要
著录项
相似文献

摘要

In the speech recognition system disclosed herein, each input utterance isconverted to a sequence of raw vectors. For each raw vector, the systemidentifies that one of a preselected plurality of quantized vectors which bestmatches the raw vector. The raw vector information is, however, retained forsubsequent utilization. Each model of a vocabulary word to be recognized is inturn represented by sequence of states, the states being selected from apreselected group of states. However, for each word module state, there isprovided both a discrete probability distribution function (pdf) and acontinuous pdf characterized by preselected adjustable parameters. A storedtable is provided which contains distance metric values for each combinationof a quantized input vector with model state as characterized by the discretepdfs. Word models are aligned with an input utterance using the respectivediscrete pdfs and initial match scores are generated using the stored table.From well matching word models identified from the initial match scores, aranked scoring of those models is generated using the respective continuouspdfs and the raw vector information. After each utterance, the preselectedcontinuous pdfs parameters are adjusted to increase, by a small proportion,the difference in scoring between the top and next ranking models. Preferably,if a user corrects a prior recognition event by selecting a different wordmodel from the respective selected group, a re-adjustment of the continuouspdfs parameters is accomplished by performing, on the current state of theparameters, an adjustment opposite to that performed with the originalrecognition event and performing on the then current state of the parametersan adjustment equal to that which would have been performed if the newlyidentified different word model had been the best scoring.

机译：在本文公开的语音识别系统中，每个输入发音是转换为原始向量序列。对于每个原始向量，系统标识最好的预选多个量化向量之一匹配原始向量。但是，原始矢量信息将保留用于后续利用。每个要识别的词汇模型都在由状态序列表示的转弯，这些状态是从预选状态组。但是，对于每个字模块状态，提供了离散概率分布函数（pdf）和连续pdf，具有预选的可调参数。一个存储提供的表格包含每个组合的距离度量值具有离散状态特征的模型状态的量化输入向量PDF文件。使用各自的单词将单词模型与输入话语对齐使用存储的表生成离散的pdf和初始匹配分数。根据从初始匹配分数中识别出的匹配良好的单词模型，这些模型的排名得分是使用相应的连续pdf和原始矢量信息。每次发声后，预先选择调整连续pdf参数以小比例增加排名最高和排名第二的模型之间的得分差异。最好，如果用户通过选择其他单词来纠正先前的识别事件各个选定组的模型，对连续模型进行重新调整pdfs参数是通过对当前状态执行参数，与原始参数相反的调整识别事件并在当时的参数状态下执行调整等于如果新确定不同的单词模型是最佳得分。

著录项

公开/公告号CA2275712A1

专利类型
公开/公告日1998-09-17

原文格式PDF
申请/专利权人 LERNOUT & HAUSPIE SPEECH PRODUCTS N.V.;SEJNOHA VLADIMIR;
展开▼

申请/专利号CA19982275712
发明设计人 SEJNOHA VLADIMIR;
展开▼

申请日1998-03-13
分类号G10L5/06;
国家 CA
入库时间 2022-08-22 02:53:47

相似文献

专利
外文文献
中文文献