...
首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Companded quantization of speech MDCT coefficients
【24h】

Companded quantization of speech MDCT coefficients

机译:语音MDCT系数的压缩扩展

获取原文
获取原文并翻译 | 示例
           

摘要

Here, we propose speech-coding procedures achieving high subjective quality, avoiding speech-specific processing and interframe exploitation. Thus, the scheme is tractable for packet-based voice communication, and has the capability of coding generic audio. The architecture is based on an modified discrete cosine transform (MDCT) representation of the signal, and combines efficient vector quantization (VQ) techniques with psychoacoustic principles. Weighted quantization of MDCT coefficients is performed, using a codebook based on a statistical model of the multidimensional MDCT pdf. The weighting and the codebook are adapted for each frame to account for masking thresholds given by a psychoacoustic analysis. Actual quantization is performed using lattices, thereby, achieving close to rate independent complexity. The result is a coding scheme operational at a range of rates. Here, a particular instance at 16 kbits/s, using a sampling frequency of 8 kHz, is shown to perform better than an LD-CELP operating at the same rate, even though no interframe memory is exploited.
机译:在这里,我们提出了实现高主观质量的语音编码程序,避免了特定于语音的处理和帧间开发。因此,该方案对于基于分组的语音通信来说是易于处理的,并且具有编码通用音频的能力。该架构基于信号的改进的离散余弦变换(MDCT)表示,并将有效的矢量量化(VQ)技术与心理声学原理相结合。使用基于多维MDCT pdf统计模型的密码本执行​​MDCT系数的加权量化。加权和码本适用于每个帧,以说明心理声学分析给出的掩蔽阈值。使用晶格执行实际的量化,从而实现接近速率无关的复杂性。结果是以一定速率范围操作的编码方案。在此,即使未使用帧间存储器,使用8 kHz采样频率的16 kbits / s的特定实例也表现出比以相同速率运行的LD-CELP更好的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号