首页> 外国专利> Pitch period segmentation of speech signals

Pitch period segmentation of speech signals

机译:语音信号的基音周期分割

摘要

A method for automatic segmentation of pitch periods of speech waveforms takes a speech waveform, a corresponding fundamental frequency contour of the speech waveform, that can be computed by some standard fundamental frequency detection algorithm, and optionally the voicing information of the speech waveform, that can be computed by some standard voicing detection algorithm, as inputs and calculates the corresponding pitch period boundaries of the speech waveform as outputs by iteratively •calculating the Fast Fourier Transform (FFT) of a speech segment having a length of approximately two periods, the period being calculated as the inverse of the mean fundamental frequency associated with these speech segments, •placing the pitch period boundary either at the position where the phase of the third FFT coefficient is −180 degrees, or at the position where the correlation coefficient of two speech segments shifted within the two period long analysis frame maximizes, or at a position calculated as a combination of both measures stated above, and repeatedly shifting the analysis frame one period length further until the end of the speech waveform is reached.
机译:用于自动分割语音波形的基音周期的方法采用语音波形,语音波形的相应基本频率轮廓(可以通过一些标准的基本频率检测算法来计算)以及可选的语音波形的发声信息,该方法可以通过一些标准的语音检测算法进行计算,作为输入,并通过迭代计算语音波形的相应音高周期边界作为输出。•计算长度约为两个周期的语音段的快速傅立叶变换(FFT)计算为与这些语音片段相关的平均基本频率的倒数,•将基音周期边界放置在第三个FFT系数的相位为-180度的位置或两个语音片段的相关系数的位置在两个周期长的分析框架内偏移最大或在位置cal作为上述两种措施的组合进行计算,并进一步将分析帧重复移动一个周期长度,直到达到语音波形的结尾为止。

著录项

  • 公开/公告号US9196263B2

    专利类型

  • 公开/公告日2015-11-24

    原文格式PDF

  • 申请/专利权人 HARALD ROMSDORFER;

    申请/专利号US201013520034

  • 发明设计人 HARALD ROMSDORFER;

    申请日2010-12-29

  • 分类号G10L21/00;G10L25/90;

  • 国家 US

  • 入库时间 2022-08-21 14:29:15

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号