首页> 外文期刊>Journal of Beijing Institute of Technology >Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification
【24h】

Variable Rate Characteristic Waveform Interpolation Speech Coder Based on Phonetic Classification

机译:基于语音分类的可变速率特征波形内插语音编码器

获取原文
获取原文并翻译 | 示例
           

摘要

A variable-bit-rate characteristic waveform interpolation (VBR-CWI) speech codec with about 1.8 kbit/s average bit rate which integrates phonetic classification into characteristic waveform (CW) decomposition is proposed. Each input frame is classified into one of 4 phonetic classes. Non-speech frames are represented with Bark-band noise model. The extracted CWs become rapidly evolving waveforms (REWs) or slowly evolving waveforms (SEWs) in the cases of unvoiced or stationary voiced frames respectively, while mixed voiced frames use the same CW decomposition as that in the conventional CWI. Experimental results show that the proposed codec can eliminate most buzzy and noisy artifacts existing in the fixed-bit-rate characteristic waveform interpolation (FBR-CWI) speech codec, the average bit rate can be much lower, and its reconstructed speech quality is much better than FS 1 016 CELP at 4.8 kbit/s and similar to G.723.1 ACELP at 5.3 kbit/s.
机译:提出了一种平均比特率约为1.8 kbit / s的可变比特率特征波形内插(VBR-CWI)语音编解码器,它将语音分类集成到特征波形(CW)分解中。每个输入帧被分为4个语音类别之一。非语音帧用Bark-band噪声模型表示。分别在无声或固定浊音帧的情况下,提取的CW变为快速演化波形(REW)或缓慢演化波形(SEW),而混合浊音帧则使用与常规CWI中相同的CW分解。实验结果表明,所提出的编解码器可以消除固定比特率特征波形插值(FBR-CWI)语音编解码器中存在的大多数嗡嗡声和杂音,平均比特率可以更低,并且其重构语音质量更好。比FS 1 016 CELP的速率为4.8 kbit / s,类似于G.723.1 ACELP的速率为5.3 kbit / s。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号