Training and search methods for speech recognition.

机译：语音识别的训练和搜索方法。

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speech recognition involves three processes: extraction of acoustic indices from the speech signal, estimation of the probability that the observed index string was caused by a hypothesized utterance segment, and determination of the recognized utterance via a search among hypothesized alternatives. This paper is not concerned with the first process. Estimation of the probability of an index string involves a model of index production by any given utterance segment (e.g., a word). Hidden Markov models (HMMs) are used for this purpose [Makhoul, J. & Schwartz, R. (1995) Proc. Natl. Acad. Sci. USA 92, 9956-9963]. Their parameters are state transition probabilities and output probability distributions associated with the transitions. The Baum algorithm that obtains the values of these parameters from speech data via their successive reestimation will be described in this paper. The recognizer wishes to find the most probable utterance that could have caused the observed acoustic index string. That probability is the product of two factors: the probability that the utterance will produce the string and the probability that the speaker will wish to produce the utterance (the language model probability). Even if the vocabulary size is moderate, it is impossible to search for the utterance exhaustively. One practical algorithm is described [Viterbi, A. J. (1967) IEEE Trans. Inf. Theory IT-13, 260-267] that, given the index string, has a high likelihood of finding the most probable utterance.

机译：语音识别涉及三个过程：从语音信号中提取声学索引，估计观察到的索引串由假设的发声段引起的概率以及通过在假设的替代方法中进行搜索来确定已识别的发声。本文与第一个过程无关。索引字符串的概率的估计涉及通过任何给定发音段（例如，单词）的索引产生的模型。隐藏的马尔可夫模型（HMM）用于此目的[Makhoul，J.＆Schwartz，R.（1995）Proc。 Natl。学院科学美国，92，9956-9963]。它们的参数是状态转换概率和与转换关联的输出概率分布。本文将描述通过语音数据的连续重新估计从语音数据中获取这些参数的值的Baum算法。识别器希望找到可能导致观察到的声学索引字符串的最可能发声。该概率是两个因素的乘积：发声将产生字符串的概率和说话者希望发声的概率（语言模型概率）。即使词汇量适中，也不可能详尽地搜索话语。描述了一种实用的算法[Viterbi，A.J。（1967）IEEE Trans。 Inf。理论IT-13，260-267]在给定索引字符串的情况下，很有可能找到最可能的话语。

著录项

期刊名称 Proceedings of the National Academy of Sciences of the United States of America
作者
F Jelinek;
展开▼
作者单位

展开▼
年(卷),期 1995(92),22
年度 1995
页码 9964–9969
总页数 6
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. TRAINING AND SEARCH METHODS FOR SPEECH RECOGNITION [J] . Jelinek F. Proceedings of the National Academy of Sciences of the United States of America . 1995,第22期

机译：语音识别的训练和搜索方法
2. The Influence of Voice Volume, Pitch, and Speech Rate on Progressive Relaxation Training: Application of Methods from Speech Pathology and Audiology [J] . Glenn E. Knowlton, Kevin T. Larkin Applied psychophysiology and biofeedback . 2006,第2期

机译：语音量，音调和语速对逐步放松训练的影响：语音病理学和听力学学方法的应用
3. Do We Need STRFs for Cocktail Parties? On the Relevance of Physiologically Motivated Features for Human Speech Perception Derived from Automatic Speech Recognition. [J] . B Kollmeier, M R René Sch?dler, A Meyer, Advances in Experimental Medicine and Biology . 2013,第Null期

机译：鸡尾酒会需要STRF吗？生理动机特征与自动语音识别衍生的人类语音感知的相关性。
4. Contextual modeling of hand written Chinese character for recognition. II. Discriminative training [C] . Yan Xiong, Qiang Huo . 1997

机译：用于识别的手写汉字的上下文建模。二。歧视性培训
5. Confidence measures as a search guide in speech recognition. [D] . Abdou, Sherif Mahdy. 2003

机译：置信度作为语音识别中的搜索指南。
6. Applying systematic review search methods to the grey literature: a review of education and training courses on breastfeeding support for health professionals [O] . Ivette Navarro, Jose M. Soriano, Salomé Laredo 2021

机译：应用系统综述搜索方法对灰色文学：对卫生专业人士的母乳喂养支持教育和培训课程综述
7. Training and search methods for speech recognition. [O] . Jelinek, F 1995

机译：语音识别的训练和搜索方法。
8. Progressive-Search Algorithms for Large-Vocabulary Speech Recognition. [R] . Murveit, H., Butzberger, J., Digalakis, V., 1993

机译：大词汇量语音识别的渐进搜索算法。

Training and search methods for speech recognition.

摘要

著录项

相似文献

相关主题

期刊订阅