首页> 外文OA文献 >New linear predictive methods for digital speech processing
【2h】

New linear predictive methods for digital speech processing

机译:数字语音处理的新线性预测方法

摘要

Speech processing is needed whenever speech is to be compressed, synthesised or recognised by the means of electrical equipment. Different types of phones, multimedia equipment and interfaces to various electronic devices, all require digital speech processing. As an example, a GSM phone applies speech processing in its RPE-LTP encoder/decoder (ETSI, 1997). In this coder, 20 ms of speech is first analysed in the short-term prediction (STP) part, and second in the long-term prediction (LTP) part. Finally, speech compression is achieved in the RPE encoding part, where only 1/3 of the encoded samples are selected to be transmitted.This thesis presents modifications for one of the most widely applied techniques in digital speech processing, namely linear prediction (LP). During recent decades linear prediction has played an important role in telecommunications and other areas related to speech compression and recognition. In linear prediction sample s(n) is predicted from its p previous samples by forming a linear combination of the p previous samples and by minimising the prediction error. This procedure in the time domain corresponds to modelling the spectral envelope of the speech spectrum in the frequency domain. The accuracy of the spectral envelope to the speech spectrum is strongly dependent on the order of the resulting all-pole filter. This, in turn, is usually related to the number of parameters required to define the model, and hence to be transmitted.Our study presents new predictive methods, which are modified from conventional linear prediction by taking the previous samples for linear combination differently. This algorithmic development aims at new all-pole techniques, which could present speech spectra with fewer parameters.
机译:每当要通过电气设备压缩,合成或识别语音时,都需要进行语音处理。不同类型的电话,多媒体设备以及与各种电子设备的接口都需要数字语音处理。例如,GSM电话在其RPE-LTP编码器/解码器中应用语音处理(ETSI,1997)。在此编码器中,首先在短期预测(STP)部分中分析20 ms语音,然后在长期预测(LTP)部分中分析20 ms。最后,在RPE编码部分实现了语音压缩,其中只有1/3的编码样本被选择进行传输。本文提出了对数字语音处理中应用最广泛的技术之一的修改,即线性预测(LP)。 。在最近的几十年中,线性预测在电信及其他与语音压缩和识别有关的领域中发挥了重要作用。在线性预测中,通过形成p个先前样本的线性组合并最小化预测误差,从其p个先前样本中预测s(n)。时域中的此过程对应于在频域中对语音频谱的频谱包络建模。频谱包络对语音频谱的准确性在很大程度上取决于所得全极点滤波器的顺序。反过来,这通常与定义模型所需的参数数量有关,因此需要传输。我们的研究提出了新的预测方法,该方法是通过将先前的线性组合样本进行不同的处理而从常规线性预测中修改而来的。该算法的开发针对于新的全极点技术,该技术可以用较少的参数呈现语音频谱。

著录项

  • 作者

    Varho Susanna;

  • 作者单位
  • 年度 2001
  • 总页数
  • 原文格式 PDF
  • 正文语种 en
  • 中图分类

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号