首页> 外国专利> Spectral phase modeling of the prototype waveform components for a frequency domain interpolative speech codec system

Spectral phase modeling of the prototype waveform components for a frequency domain interpolative speech codec system

机译:频域内插语音编解码系统的原型波形分量的频谱相位建模

摘要

Encoding of prototype waveform components applicable to GeoMobile and Telephony Earth Station (TES) providing improved voice quality enabling a dual-channel mode of operation which permits more users to communicate over the same physical channel. A prototype word (PW) gain is vector quantized using a vector quantizer (VQ) that explicitly populates the codebook by representative steady state and transient vectors of PW gain for tracking the abrupt variations in speech levels during onsets and other non-stationary events, while maintaining the accuracy of the speech level during stationary conditions. The rapidly evolving waveform (REW) and slowly evolving waveform (SEW) component vectors are converted to magnitude-phase. The variable dimension SEW magnitude vector is quantized using a hierarchical approach, i.e., a fixed dimension SEW mean vector computed by a sub-band averaging of SEW magnitude spectrum, and only the REW magnitude is explicitly encoded. The REW magnitude vector sequence is normalized to unity RMS value, resulting in a REW magnitude shape vector and a REW gain vector. The normalized REW magnitude vectors are modeled by a multi-band sub-band model which converts the variable dimension REW magnitude shape vectors, e.g., six dimensional REW sub-band vectors. The sub-band vectors are averaged over time, resulting in a single average REW sub-band vector for each frame. At the decoder, the full-dimension REW magnitude shape vector is obtained from the REW sub-band vector by a piecewise-constant construction. The REW phase vector is regenerated at the decoder based on the received REW gain vector and the voicing measure, which determines a weighted mixture of SEW component and a random noise that is passed through a high pass filter to generate the REW component. The high pass filter poles are adjusted based on the voicing measure to control the REW component characteristics. At the output the filter, the magnitude of the REW component is scaled to match the received REW magnitude vector.
机译:适用于GeoMobile和电话地球站(TES)的原型波形分量的编码提供了改进的语音质量,从而实现了双通道操作模式,允许更多用户在同一物理通道上进行通信。使用矢量量化器(VQ)对原型字(PW)增益进行矢量量化,该矢量量化器通过PW增益的代表性稳态和瞬态矢量显式填充码本,以跟踪发作和其他非平稳事件期间语音水平的突然变化,而在静止状态下保持语音水平的准确性。快速发展的波形(REW)和缓慢发展的波形(SEW)分量矢量被转换为幅度相位。使用分级方法量化可变维数SEW幅度矢量,即,通过对SEW幅度谱的子带平均来计算的固定维数SEW均值矢量,并且仅对REW幅度进行明确编码。将REW幅度矢量序列归一化为单位RMS值,得到REW幅度形状矢量和REW增益矢量。归一化的REW幅值矢量由多带子带模型建模,该子带模型转换可变尺寸的REW幅值形状矢量,例如,六维REW子带矢量。子带向量随时间平均,得出每个帧的单个平均REW子带向量。在解码器处,通过分段恒定构造从REW子带向量获得全尺寸REW幅度形状向量。 REW相位矢量在解码器上基于接收到的REW增益矢量和发声度量重新生成,该测声度量确定SEW分量和随机噪声的加权混合,并通过高通滤波器生成REW分量。根据发声方式调整高通滤波器极点,以控制REW组件的特性。在滤波器的输出端,按比例缩放REW分量的大小以匹配接收到的REW大小向量。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号