首页> 外国专利> Method of speech recognition using variables representing dynamic aspects of speech

Method of speech recognition using variables representing dynamic aspects of speech

机译：使用代表语音动态方面的变量的语音识别方法

页面导航

摘要
著录项
相似文献

摘要

A method and computer-readable medium are provided that determine predicted acoustic values for a sequence of hypothesized speech units using modeled articulatory or VTR dynamics values and using the modeled relationship between the articulatory (or VTR) and acoustic values for the same speech events. Under one embodiment, the articulatory (or VTR) dynamics value depends on articulatory dynamics values at pervious time frames and articulation targets. In another embodiment, the articulatory dynamics value depends in part on an acoustic environment value such as noise or distortion. In a third embodiment, a time constant that defines the articulatory dynamics value is trained using a variety of articulation styles. By modeling the articulatory or VTR dynamics value in these manners, hyper-articulated, hypo-articulated, fast, and slow speech can be better recognized and the requirement for the training data can be reduced.

机译：提供了一种方法和计算机可读介质，其使用建模的发音或VTR动态值并针对相同的语音事件使用发音（或VTR）和声学值之间的建模关系来确定一系列假定语音单元的预测声学值。在一个实施例中，关节运动（或VTR）动力学值取决于在先前时间帧和关节运动目标处的关节运动动力学值。在另一个实施例中，发音动态值部分地取决于声学环境值，例如噪声或失真。在第三实施例中，使用多种发音方式来训练定义发音动态值的时间常数。通过以这些方式对发音或VTR动力学值进行建模，可以更好地识别超发音，低发音，快和慢的语音，并且可以减少对训练数据的需求。

著录项

公开/公告号US2003182110A1

专利类型
公开/公告日2003-09-25

原文格式PDF
申请/专利权人 DENG LI;
展开▼

申请/专利号US20020100717
发明设计人 LI DENG;
展开▼

申请日2002-03-19
分类号G10L15/00;
国家 US
入库时间 2022-08-22 00:09:57

相似文献

专利
外文文献
中文文献