首页> 外国专利> Selection of scalar quantization(SQ) and vector quantization (VQ) for speech coding

Selection of scalar quantization(SQ) and vector quantization (VQ) for speech coding

机译：语音编码的标量量化（SQ）和矢量量化（VQ）的选择

页面导航

摘要
著录项
相似文献

摘要

A multi-rate speech codec supports a plurality of encoding bit rate modes by adaptively selecting encoding bit rate modes to match communication channel restrictions. In higher bit rate encoding modes, an accurate representation of speech through CELP (code excited linear prediction) and other associated modeling parameters are generated for higher quality decoding and reproduction. For each bit rate mode selected, pluralities of fixed or innovation subcodebooks are selected for use in generating innovation vectors. The speech coder distinguishes various voice signals as a function of their voice content. For example, a Voice Activity Detection (VAD) algorithm selects an appropriate coding scheme depending on whether the speech signal comprises active or inactive speech. The encoder may consider varying characteristics of the speech signal including sharpness, a delay correlation, a zero-crossing rate, and a residual energy. In another embodiment of the present invention, code excited linear prediction is used for voice active signals whereas random excitation is used for voice inactive signals; the energy level and spectral content of the voice inactive signal may also be used for noise coding. The multi-rate speech codec may employ distributed detection and compensation processing the speech signal. For high quality perceptual speech reproduction, the speech codec may perform noise detection in both an encoder and decoder. The noise detection may be coordinated between the encoder and decoder. Similarly, noise compensation may be performed in a distributed manner among both the decoder and the encoder.

机译：多速率语音编解码器通过自适应地选择编码比特率模式以匹配通信信道限制来支持多种编码比特率模式。在更高的比特率编码模式中，将生成通过CELP（代码激励线性预测）和其他相关建模参数的语音的准确表示，以实现更高质量的解码和再现。对于选择的每种比特率模式，选择多个固定或创新子码本以用于生成创新矢量。语音编码器根据语音内容区分各种语音信号。例如，语音活动检测（VAD）算法根据语音信号包括有效语音还是无效语音来选择适当的编码方案。编码器可以考虑语音信号的变化特性，包括清晰度，延迟相关性，过零率和残余能量。在本发明的另一个实施例中，码激励线性预测被用于语音激活信号，而随机激励被用于语音非激活信号。语音无效信号的能量水平和频谱内容也可以用于噪声编码。多速率语音编解码器可以采用分布式检测和补偿处理语音信号。为了获得高质量的感知语音再现，语音编解码器可以在编码器和解码器中执行噪声检测。噪声检测可以在编码器和解码器之间协调。类似地，可以在解码器和编码器两者之间以分布式方式执行噪声补偿。

著录项

公开/公告号EP2085966A1

专利类型
公开/公告日2009-08-05

原文格式PDF
申请/专利权人 MINDSPEED TECHNOLOGIES INC.;
展开▼

申请/专利号EP20090152360
发明设计人 THYSSEN JES;SU HUAN-YU;GAO YANG;BENYASSINE ADIL;
展开▼

申请日1999-08-24
分类号G10L19/14;
国家 EP
入库时间 2022-08-21 19:15:07

相似文献

专利
外文文献
中文文献