首页> 外国专利> Formant tracking based on phoneme information

Formant tracking based on phoneme information

机译:基于音素信息的共振峰跟踪

摘要

A method and system for selecting formant trajectories based on input speech and corresponding text data. The input speech is analyzed to obtain formant candidates for the respective time frame. The text data corresponding to the input speech is converted into a sequence of phonemes which are then time aligned such that each phoneme is temporally labeled with a corresponding segment of the input speech. Nominal formant frequencies are assigned to a center timing point of each phoneme and target formant trajectories are generated for each time frame by interpolating the nominal formant frequencies between adjacent phonemes. For each time frame, at least one formant candidate that is closest to the corresponding target formant trajectories is selected according to a minimum cost factor. The selected formant candidates are output for storage or further processing in subsequent speech applications.
机译:一种基于输入语音和对应的文本数据选择共振峰轨迹的方法和系统。分析输入语音以获得相应时间框架的共振峰候选者。对应于输入语音的文本数据被转换成一系列音素,然后将它们进行时间对齐,以使每个音素在时间上都带有对应的输入语音片段。将名义共振峰频率分配给每个音素的中心定时,并通过在相邻音素之间内插名义共振峰频率为每个时间帧生成目标共振峰轨迹。对于每个时间范围,根据最小成本因子选择至少一个最接近相应目标共振峰轨迹的共振峰候选者。所选择的共振峰候选者被输出以在随后的语音应用中存储或进一步处理。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号