Improved pitch modelling for low bit-rate speech coders.

机译：针对低比特率语音编码器的改进音调建模。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

During the last several years, there has been a dramatic growth of digital services, such as digital wireless and wireline communications, satellite communications and digital voice storage systems. Such services require the use of high-quality low bit-rate coders to efficiently code the speech signal before transmission or storage. The majority of such coders employ algorithms that are based on Code-Excited Linear Prediction (CELP).;The goal of this thesis is to improve the quality of CELP coded speech, while keeping the basic coding format intact. The quality improvement is focused on voiced speech segments. A Pitch Pulse Averaging (PPA) algorithm has been developed to enhance the periodicity of such segments, where during steady state voicing the pitch pulse waveforms in the excitation signal evolve slowly in time. The PPA algorithm extracts a number of such pitch pulse waveforms from the past excitation, aligns them, and then averages them to produce a new pitch pulse waveform with reduced noise.;The PPA algorithm has been simulated and tested on a floating point C-simulation of the G.729 8 kbps CS-ACELP coder. Objective tests verified that the algorithm contributes most during steady state voiced speech. Thus a simple voicing decision mechanism has been developed to deactivate the algorithm during unvoiced segments and voicing onsets of speech. Results verified that the algorithm has generally improved the periodicity of voiced segments by reducing the average of the weighted mean-squared error.;While we were able to demonstrate improvements in objective measures, informal listening tests indicate that the already high perceptual quality of G.729 is generally not audibly altered. Nonetheless, the technique may be useful for improving the quality at lower rates, particularly for next generation low bit-rate coders operating near 4 kbps.

机译：在过去的几年中，数字服务取得了巨大的增长，例如数字无线和有线通信，卫星通信和数字语音存储系统。这样的服务需要使用高质量的低比特率编码器来在传输或存储之前有效地对语音信号进行编码。此类编码器中大多数采用基于码激励线性预测（CELP）的算法。本文的目的是在保持基本编码格式完整的同时，提高CELP编码语音的质量。质量改进的重点是浊音段。已经开发出音调脉冲平均（PPA）算法来增强此类段的周期性，其中在稳态发声期间，激励信号中的音调脉冲波形会随时间缓慢演化。 PPA算法从过去的激励中提取许多此类音调脉冲波形，对其进行对齐，然后将其平均以生成具有降低噪声的新音调脉冲波形。; PPA算法已在浮点C模拟中进行了仿真和测试G.729 8 kbps CS-ACELP编码器的功能。客观测试证明，该算法在稳态语音过程中贡献最大。因此，已经开发了一种简单的发声决策机制，以在未发声的片段和语音发声期间停用算法。结果证明，该算法通过减少加权均方误差的平均值，总体上改善了语音段的周期性。;尽管我们能够证明客观指标的改进，但非正式的听力测试表明，G的感知质量已经很高。 729通常不会在听觉上发生变化。尽管如此，该技术对于以较低的速率提高质量可能是有用的，特别是对于工作在4 kbps附近的下一代低比特率编码器而言。

著录项

作者
Papacostantinou, Costantinos.;
展开▼
作者单位

McGill University (Canada).;

展开▼
授予单位 McGill University (Canada).;
学科 Engineering Electronics and Electrical.
学位 M.Eng.
年度 1997
页码 84 p.
总页数 84
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Improving security of quantization-index-modulation steganography in low bit-rate speech streams [J] . Hui Tian, Jin Liu, Songbin Li Multimedia Systems . 2014,第2期

机译：提高低比特率语音流中量化指标调制隐写技术的安全性
2. Improved consonant-vowel recognition for low bit-rate coded speech [J] . Anil Kumar Vuppala, K. Sreenivasa Rao, Saswat Chakrabarti International Journal of Adaptive Control and Signal Processing . 2012,第4期

机译：改进的辅音元音识别，可实现低比特率编码语音
3. Low bit-rate speech coding based on an improved sinusoidal mdoel [J] . Sassan Ahmadi, Andreas S. Spanias Speech Communication . 2001,第4期

机译：基于改进的正弦模型的低比特率语音编码
4. Pitch quantization in low bit-rate speech coding [C] . Eriksson, T., Hong-Goo Kang . 1999

机译：低比特率语音编码中的基音量化
5. Objective speech intelligibility assessment using speech recognition and bigram statistics with application to low bit-rate codec evaluation [D] . Teng, Yan 2006

机译：使用语音识别和双字母组统计的客观语音清晰度评估及其在低比特率编解码器评估中的应用
6. PNAS Plus: Piano training enhances the neural processing of pitch and improves speech perception in Mandarin-speaking children [O] . Yun Nan, Li Liu, Eveline Geiser, 2018

机译：PNAS Plus：钢琴培训可增强说普通话的孩子对音高的神经处理能力并改善其语音感知能力
7. Universal steganography model for low bit-rate speech codec [O] . Tang Shanyu, Chen Qing, Zhang Wei, 2016

机译：用于低比特率语音编解码器的通用隐写模型

Improved pitch modelling for low bit-rate speech coders.

摘要

著录项

相似文献

相关主题

期刊订阅