首页> 外国专利> Reconstruction of wideband speech from narrowband speech using codebooks

Reconstruction of wideband speech from narrowband speech using codebooks

机译:使用码本从窄带语音中重建宽带语音

摘要

A wideband speech signal (8 kHz, for example) of high quantity is reconstructed from a narrowband speech signal (300 Hz to 3.4 kHz). The input narrowband speech signal is LPC-analyzed to obtain spectrum information parameters, and the parameters are vector-quantized using a narrowband speech signal codebook. For each code number of the narrowband speech signal codebook, the wideband speech waveform corresponding to the codevector concerned is extracted by one pitch for voiced speech and by one frame for unvoiced speech and prestored in a representative waveform codebook. Representative waveform segments corresponding to the respective output codevector numbers of the quantizer are extracted from the representative waveform codebook. Voiced speech is synthesized by pitch-synchronous overlapping of the extracted representative waveform segments and unvoiced speech is synthesized by randomly using waveforms of one frame length. By this, a wideband speech signal is produced. Then, frequency components below 300 Hz and above 3.4 kHz are extracted from the wideband speech signal and are added to an up-sampled version of the input narrowband speech signal to thereby reconstruct the wideband speech signal.
机译:从窄带语音信号(300Hz至3.4kHz)重建大量的宽带语音信号(例如8kHz)。对输入的窄带语音信号进行LPC分析,以获得频谱信息参数,然后使用窄带语音信号码本对参数进行矢量量化。对于窄带语音信号代码簿的每个代码号,对应于所涉及的代码矢量的宽带语音波形被提取用于语音语音的一个音调和用于清语音的一帧,并被预先存储在代表性波形代码簿中。从代表波形码本中提取与量化器的各个输出码矢量号相对应的代表波形段。通过提取的代表波形段的音高同步重叠来合成语音语音,并且通过随机地使用一帧长度的波形来合成清音语音。由此,产生宽带语音信号。然后,从宽带语音信号中提取低于300 Hz且高于3.4 kHz的频率分量,并将其添加到输入窄带语音信号的上采样版本中,从而重建宽带语音信号。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号