首页> 外国专利> Location and coding of unvoiced plosives in linear predictive coding of speech

Location and coding of unvoiced plosives in linear predictive coding of speech

机译:语音线性预测编码中清浊词的位置和编码

摘要

A method of encoding signal segments which represent unvoiced plosives. The signal segments to be encoded are contained within a speech signal divided into m=1, . . . , N frames. Each frame is subdivided into l=1, . . . , L subframes. The speech signal has a gain gm(l) within each subframe. An energy measure em(l) representative of the signal segments' energy content is defined. An energy threshold eth(l) representative of a sudden energy change characteristic of an unvoiced plosive is also defined. For each frame, the energy measure em(l) and the energy threshold eth(l) are derived for each subframe within that frame. If em(l)=eth(l) for each subframe within a particular frame, then a plosive locator lpl=0 and a plosive index ipl=0 are assigned to that frame to indicate absence of a plosive within that frame. If em(l)eth(l) for any subframe within the frame, then that frame's plosive locator lpl is assigned a non-zero value, with the plosive locator's value indicating location of the plosive at a transition point immediately following that one of the subframes within the frame for which em(l)-eth(l) is greatest; and, that frame's plosive index ipl is assigned a non-zero value representing presence of a plosive within that frame.
机译:一种编码表示清浊音的信号段的方法。将被编码的信号段包含在被划分为m = 1,...,n的语音信号内。 。 。 ,N帧。每个帧细分为l = 1,...。 。 。 L个子帧。语音信号在每个子帧内具有增益gm(l)。定义了代表信号段能量含量的能量度量em(l)。还定义了代表无声炸药的突然能量变化特性的能量阈值eth(l)。对于每个帧,针对该帧内的每个子帧导出能量量度em(l)和能量阈值eth(l)。如果对于特定帧内的每个子帧,em(1)<= eth(1),则将爆破定位符lpl = 0和爆破索引ipl = 0分配给该帧以指示在该帧内不存在爆破音。如果该帧内的任何子帧的em(l)> eth(l),则为该帧的爆破定位符lpl分配一个非零值,爆破定位符的值指示爆破音在紧随其中一个之后的过渡点处的位置em(l)-eth(l)最大的帧中的子帧;并且,该帧的爆破音索引ipl被分配了一个非零值,表示该帧内爆破音的存在。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号