首页> 外文学位 >Improved pitch modelling for low bit-rate speech coders.
【24h】

Improved pitch modelling for low bit-rate speech coders.

机译:针对低比特率语音编码器的改进音调建模。

获取原文
获取原文并翻译 | 示例

摘要

During the last several years, there has been a dramatic growth of digital services, such as digital wireless and wireline communications, satellite communications and digital voice storage systems. Such services require the use of high-quality low bit-rate coders to efficiently code the speech signal before transmission or storage. The majority of such coders employ algorithms that are based on Code-Excited Linear Prediction (CELP).;The goal of this thesis is to improve the quality of CELP coded speech, while keeping the basic coding format intact. The quality improvement is focused on voiced speech segments. A Pitch Pulse Averaging (PPA) algorithm has been developed to enhance the periodicity of such segments, where during steady state voicing the pitch pulse waveforms in the excitation signal evolve slowly in time. The PPA algorithm extracts a number of such pitch pulse waveforms from the past excitation, aligns them, and then averages them to produce a new pitch pulse waveform with reduced noise.;The PPA algorithm has been simulated and tested on a floating point C-simulation of the G.729 8 kbps CS-ACELP coder. Objective tests verified that the algorithm contributes most during steady state voiced speech. Thus a simple voicing decision mechanism has been developed to deactivate the algorithm during unvoiced segments and voicing onsets of speech. Results verified that the algorithm has generally improved the periodicity of voiced segments by reducing the average of the weighted mean-squared error.;While we were able to demonstrate improvements in objective measures, informal listening tests indicate that the already high perceptual quality of G.729 is generally not audibly altered. Nonetheless, the technique may be useful for improving the quality at lower rates, particularly for next generation low bit-rate coders operating near 4 kbps.
机译:在过去的几年中,数字服务取得了巨大的增长,例如数字无线和有线通信,卫星通信和数字语音存储系统。这样的服务需要使用高质量的低比特率编码器来在传输或存储之前有效地对语音信号进行编码。此类编码器中大多数采用基于码激励线性预测(CELP)的算法。本文的目的是在保持基本编码格式完整的同时,提高CELP编码语音的质量。质量改进的重点是浊音段。已经开发出音调脉冲平均(PPA)算法来增强此类段的周期性,其中在稳态发声期间,激励信号中的音调脉冲波形会随时间缓慢演化。 PPA算法从过去的激励中提取许多此类音调脉冲波形,对其进行对齐,然后将其平均以生成具有降低噪声的新音调脉冲波形。; PPA算法已在浮点C模拟中进行了仿真和测试G.729 8 kbps CS-ACELP编码器的功能。客观测试证明,该算法在稳态语音过程中贡献最大。因此,已经开发了一种简单的发声决策机制,以在未发声的片段和语音发声期间停用算法。结果证明,该算法通过减少加权均方误差的平均值,总体上改善了语音段的周期性。;尽管我们能够证明客观指标的改进,但非正式的听力测试表明,G的感知质量已经很高。 729通常不会在听觉上发生变化。尽管如此,该技术对于以较低的速率提高质量可能是有用的,特别是对于工作在4 kbps附近的下一代低比特率编码器而言。

著录项

  • 作者单位

    McGill University (Canada).;

  • 授予单位 McGill University (Canada).;
  • 学科 Engineering Electronics and Electrical.
  • 学位 M.Eng.
  • 年度 1997
  • 页码 84 p.
  • 总页数 84
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类 无线电电子学、电信技术;
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号