首页> 外国专利> System and method for automatic speech recognition from phonetic features and acoustic landmarks

System and method for automatic speech recognition from phonetic features and acoustic landmarks

机译：从语音特征和声学界标自动语音识别的系统和方法

页面导航

摘要
著录项
相似文献

摘要

A probabilistic framework for acoustic-phonetic automatic speech recognition organizes a set of phonetic features into a hierarchy consisting of a broad manner feature sub-hierarchy and a fine phonetic feature sub-hierarchy. Each phonetic feature of said hierarchy corresponds to a set of acoustic correlates and each broad manner feature of said broad manner feature sub-hierarchy is further associated with a corresponding set of acoustic landmarks. A pattern recognizer is trained from a knowledge base of phonetic features and corresponding acoustic correlates. Acoustic correlates are extracted from a speech signal and are presented to the pattern recognizer. Acoustic landmarks are identified and located from broad manner classes classified by the pattern recognizer. Fine phonetic features are determined by the pattern recognizer at and around the acoustic landmarks. The determination of fine phonetic features may be constrained by a pronunciation model. The most probable feature bundles corresponding to words and sentences are those that maximize the joint a posteriori probability of the fine phonetic features and corresponding acoustic landmarks. When the hierarchy is organized as a binary tree, binary classifiers such as Support Vector Machines can be used in the pattern classifier and the outputs thereof can be converted probability measures which, in turn may be used in the computation of the aforementioned joint probability of fine phonetic features and corresponding landmarks.

机译：用于语音自动语音识别的概率框架将一组语音特征组织到一个层次结构中，该层次结构由广泛的方式特征子层次结构和精细的语音特征子层次结构组成。所述层次结构的每个语音特征对应于一组声学相关性，并且所述宽泛方式特征子层次结构的每个宽泛方式特征还与一组对应的声学界标相关联。模式识别器是从语音特征和相应的声学关联的知识库中训练出来的。从语音信号中提取声学相关，并将其呈现给模式识别器。从模式识别器分类的广泛方式类别中识别和定位声音界标。精细的语音特征是由模式识别器在声学界标处及其周围确定的。良好的语音特征的确定可能受发音模型的约束。与单词和句子相对应的最可能的特征束是那些最大化精细语音特征和相应的声音界标的后验概率联合的特征束。当层次结构被组织为二叉树时，可以在模式分类器中使用诸如支持向量机之类的二进制分类器，并且可以将其输出转换为概率度量，该概率度量又可以用于计算前述的联合罚款概率。语音特征和相应的地标。

著录项

公开/公告号US2006212296A1

专利类型
公开/公告日2006-09-21

原文格式PDF
申请/专利权人 CAROL ESPY-WILSON;AMIT JUNEJA;
展开▼

申请/专利号US20050081507
发明设计人 CAROL ESPY-WILSON;AMIT JUNEJA;
展开▼

申请日2005-03-17
分类号G10L15/04;
国家 US
入库时间 2022-08-21 21:46:42

相似文献

专利
外文文献
中文文献