首页> 外国专利> Location and coding of unvoiced plosives in linear predictive coding of speech

Location and coding of unvoiced plosives in linear predictive coding of speech

机译:语音线性预测编码中清浊词的位置和编码

摘要

A method of encoding signal segments which represent unvoiced plosives. The signal segments to be encoded are contained within a speech signal divided into m1, . . . , N frames. Each frame is subdivided into l1, . . . , L subframes. The speech signal has a gain gm(l) within each subframe. An energy measure em(l) representative of the signal segments' energy content is defined. An energy threshold eth(l) representative of a sudden energy change characteristic of an unvoiced plosive is also defined. For each frame, the energy measure em(l) and the energy threshold eth(l) are derived for each subframe within that frame. If em(l)eth(l) for each subframe within a particular frame, then a plosive locator lpl0 and a plosive index ipl0 are assigned to that frame to indicate absence of a plosive within that frame. If em(l)eth(l) for any subframe within the frame, then that frame's plosive locator lpl is assigned a non-zero value, with the plosive locator's value indicating location of the plosive at a transition point immediately following that one of the subframes within the frame for which em(l)eth(l) is greatest; and, that frame's plosive index ipl is assigned a non-zero value representing presence of a plosive within that frame.
机译:一种编码表示清浊音的信号段的方法。待编码的信号段包含在语音信号中,该语音信号被分为m1,..., 。 。 ,N帧。每个帧被细分为。 。 。 L个子帧。语音信号在每个子帧内具有增益g m (l)。定义了代表信号段能量含量的能量度量e m (l)。还定义了代表无声炸药的突然能量变化特征的能量阈值e th (l)。对于每个帧,为该帧内的每个子帧导出能量度量e m (l)和能量阈值e th (l)。如果对于特定帧内的每个子帧e m (l)e th (l),则爆破定位符l pl 0和爆破定位符索引i pl 0被分配给该帧以指示该帧内没有爆破音。如果帧中任何子帧的e m (l)> e (l),则该帧的爆炸定位符l pl 被分配为非零值,爆破定位符的值指示爆破在紧接帧中子帧之一的过渡点处e m (l)e th (l)最大;并且为该帧的爆炸性索引i pl 分配了一个非零值,该值表示该帧中爆炸声的存在。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号