Voicing-specific LPC quantization for variable-rate speech coding

Hagen R.; Paksoy E.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >Voicing-specific LPC quantization for variable-rate speech coding

【24h】

Voicing-specific LPC quantization for variable-rate speech coding

机译：语音专用LPC量化用于可变速率语音编码

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Phonetic classification of speech frames allows distinctive quantization and bit allocation schemes suited to the particular class. Separate quantization of the linear predictive coding (LPC) parameters for voiced and unvoiced speech frames is shown to offer useful gains for representing the synthesis filter commonly used in code-excited linear prediction (CELP) and other coders. Subjective test results are reported that determine the required bit rate and accuracy in the two classes of voiced and unvoiced LPC spectra for CELP coding with phonetic classification. It was found, in this context, that unvoiced spectra need 9 b/frame or more whereas voiced spectra need 25 b/frame or more with the quantization schemes used. New spectral distortion criteria needed to assure transparent LPC spectral quantization for each voicing class in CELP coders are presented. Similar subjective test results for speech synthesized from the true residual signal are also presented, leading to some interesting observations on the role of the analysis-by-synthesis structure of CELP. Objective performance assessments based on the spectral distortion measure are also presented. The theoretical distortion-rate function for the spectral distortion measure is estimated for voiced and unvoiced LPC parameters and compared with experimental results obtained with unstructured vector quantization (VQ). These results show a saving of at least 2 b/frame for unvoiced spectra compared to voiced spectra to achieve the same spectral distortion performance.

机译：语音帧的语音分类可以实现适合特定类别的独特量化和位分配方案。已显示有声和无声语音帧的线性预测编码（LPC）参数的单独量化可提供有用的增益，以表示通常在代码激励线性预测（CELP）和其他编码器中使用的合成滤波器。据报道，主观测试结果确定了带有语音分类的CELP编码的两类有声和无声LPC频谱中所需的比特率和准确性。在这种情况下，发现使用所使用的量化方案，清音频谱需要9b /帧或更多，而浊音频谱需要25b /帧或更多。提出了新的频谱失真准则，以确保CELP编码器中每个语音分类的透明LPC频谱量化。还提出了从真实残差信号合成的语音的类似主观测试结果，从而引起了关于CELP的综合分析结构的作用的有趣观察。还提出了基于频谱失真测量的客观性能评估。对于有声和无声LPC参数，估计了频谱失真测量的理论失真率函数，并将其与通过非结构化矢量量化（VQ）获得的实验结果进行了比较。这些结果表明，与浊音频谱相比，浊音频谱至少节省了2 b /帧，以实现相同的频谱失真性能。

著录项

来源
《IEEE Transactions on Speech and Audio Proceeding》 |1999年第5期|P.485-494|共10页
作者
Hagen R.; Paksoy E.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Voicing-specific LPC quantization for variable-rate speech coding [J] . Hagen R., Paksoy E., Gersho A. IEEE Transactions on Speech and Audio Proceessing . 1999,第5期

机译：语音专用LPC量化用于可变速率语音编码
2. Variable-rate finite-state vector quantization and applications to speech and image coding [J] . Hussain Y., Farvardin N. IEEE Transactions on Speech and Audio Proceeding . 1993,第1期

机译：可变速率有限状态矢量量化及其在语音和图像编码中的应用
3. Single and double frame coding of speech LPC parameters using a lattice-based quantization scheme [J] . Lahouti F., Fazel A.R., Safavi-Naeini A.H., IEEE transactions on audio, speech and language processing . 2006,第5期

机译：使用基于晶格的量化方案对语音LPC参数进行单帧和双帧编码
4. A variable-rate harmonic speech coder with efficient spectral quantization [C] . Yu, E.W.M., Cheung-Fat Chan . 1999

机译：具有有效频谱量化的可变速率谐波语音编码器
5. Speech coding using Linear Predictive Coding (LPC10). [D] . Naik, Palak H. 2007

机译：使用线性预测编码（LPC10）的语音编码。
6. A scalable speech coding scheme using compressive sensing and orthogonal mapping based vector quantization [O] . M.S. Arun Sankar, P.S. Sathidevi 2019

机译：使用压缩感知和基于正交映射的矢量量化的可伸缩语音编码方案
7. Variable-Rate Finite-State Vector Quantization and Applications to Speech and Image Coding [O] . Hussain, Yunus, Farvardin, Nariman 1991

机译：可变速率有限状态矢量量化及其在语音和图像编码中的应用
8. Phase-Only Version of the LPC (Linear Predictive Coding) Residual in Speech Coding. [R] . Milios, E. E., Oppenheim, A. V. 1983

机译：语音编码中LpC（线性预测编码）残差的仅相位版本。

Voicing-specific LPC quantization for variable-rate speech coding

摘要

著录项

相似文献

相关主题

期刊订阅