首页> 外国专利> Formant tracking based on phoneme information

Formant tracking based on phoneme information

机译：基于音素信息的共振峰跟踪

页面导航

摘要
著录项
相似文献

摘要

A method and system for selecting formant trajectories based on input speech and corresponding text data. The input speech is analyzed to obtain formant candidates for the respective time frame. The text data corresponding to the input speech is converted into a sequence of phonemes which are then time aligned such that each phoneme is temporally labeled with a corresponding segment of the input speech. Nominal formant frequencies are assigned to a center timing point of each phoneme and target formant trajectories are generated for each time frame by interpolating the nominal formant frequencies between adjacent phonemes. For each time frame, at least one formant candidate that is closest to the corresponding target formant trajectories is selected according to a minimum cost factor. The selected formant candidates are output for storage or further processing in subsequent speech applications.

机译：一种基于输入语音和对应的文本数据选择共振峰轨迹的方法和系统。分析输入语音以获得相应时间框架的共振峰候选者。对应于输入语音的文本数据被转换成一系列音素，然后将它们进行时间对齐，以使每个音素在时间上都带有对应的输入语音片段。将名义共振峰频率分配给每个音素的中心定时，并通过在相邻音素之间内插名义共振峰频率为每个时间帧生成目标共振峰轨迹。对于每个时间范围，根据最小成本因子选择至少一个最接近相应目标共振峰轨迹的共振峰候选者。所选择的共振峰候选者被输出以在随后的语音应用中存储或进一步处理。

著录项

公开/公告号US6618699B1

专利类型
公开/公告日2003-09-09

原文格式PDF
申请/专利权人 LUCENT TECHNOLOGIES INC.;
展开▼

申请/专利号US19990386037
发明设计人 BERND MOEBIUS;MINKYU LEE;JAN PIETER VAN SANTEN;JOSEPH PHILIP OLIVE;
展开▼

申请日1999-08-30
分类号G10L190/20;
国家 US
入库时间 2022-08-22 00:04:17

相似文献

专利
外文文献
中文文献