Companded quantization of speech MDCT coefficients

Norden F.; Hedelin P.

首页> 外文期刊>IEEE Transactions on Speech and Audio Proceessing >Companded quantization of speech MDCT coefficients

【24h】

Companded quantization of speech MDCT coefficients

机译：语音MDCT系数的压缩扩展

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Here, we propose speech-coding procedures achieving high subjective quality, avoiding speech-specific processing and interframe exploitation. Thus, the scheme is tractable for packet-based voice communication, and has the capability of coding generic audio. The architecture is based on an modified discrete cosine transform (MDCT) representation of the signal, and combines efficient vector quantization (VQ) techniques with psychoacoustic principles. Weighted quantization of MDCT coefficients is performed, using a codebook based on a statistical model of the multidimensional MDCT pdf. The weighting and the codebook are adapted for each frame to account for masking thresholds given by a psychoacoustic analysis. Actual quantization is performed using lattices, thereby, achieving close to rate independent complexity. The result is a coding scheme operational at a range of rates. Here, a particular instance at 16 kbits/s, using a sampling frequency of 8 kHz, is shown to perform better than an LD-CELP operating at the same rate, even though no interframe memory is exploited.

机译：在这里，我们提出了实现高主观质量的语音编码程序，避免了特定于语音的处理和帧间开发。因此，该方案对于基于分组的语音通信来说是易于处理的，并且具有编码通用音频的能力。该架构基于信号的改进的离散余弦变换（MDCT）表示，并将有效的矢量量化（VQ）技术与心理声学原理相结合。使用基于多维MDCT pdf统计模型的密码本执行MDCT系数的加权量化。加权和码本适用于每个帧，以说明心理声学分析给出的掩蔽阈值。使用晶格执行实际的量化，从而实现接近速率无关的复杂性。结果是以一定速率范围操作的编码方案。在此，即使未使用帧间存储器，使用8 kHz采样频率的16 kbits / s的特定实例也表现出比以相同速率运行的LD-CELP更好的性能。

著录项

来源
《IEEE Transactions on Speech and Audio Proceessing》 |2005年第2期|p.163-173|共11页
作者
Norden F.; Hedelin P.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类电声技术和语音信号处理;
关键词
audio coding; computational complexity; discrete cosine transforms; speech coding; statistical analysis; vector quantisation; voice communication; companded quantization; generic audio coding; interframe memory exploitation; modified discrete cosine transform; mult;

机译：音频编码;计算复杂度;离散余弦变换;语音编码;统计分析;矢量量化;语音通信;组合量化;通用音频编码;帧间存储器开发;改进的离散余弦变换;mult;

相似文献

外文文献
中文文献
专利

1. Superwideband Bandwidth Extension Using Normalized MDCT Coefficients for Scalable Speech and Audio Coding [J] . Young Han Lee, Seung Ho Choi Advances in multimedia . 2013,第期

机译：使用归一化MDCT系数进行可扩展语音和音频编码的超宽带扩展
2. Superwideband Bandwidth Extension Using Normalized MDCT Coefficients for Scalable Speech and Audio Coding [J] . Young HanLee, Seung HoChoi Advances in multimedia . 2013,第1期

机译：使用归一化MDCT系数进行可扩展语音和音频编码的超宽带扩展
3. A Search Complexity Improvement of Vector Quantization to Immittance Spectral Frequency Coefficients in AMR-WB Speech Codec [J] . Bing-Jhih Yao, Cheng-Yu Yeh, Shaw-Hwa Hwang Symmetry . 2016,第10期

机译：向量量化对AMR-WB语音编解码器的阻抗谱频率系数的搜索复杂度的提高
4. A finite-state entropy-constrained vector quantizer for audio MDCT coefficients coding [C] . Jiang Sumxin, Yin Rendong, Liu Peilin 2012 International Conference on Audio, Language and Image Processing. . 2012

机译：音频MDCT系数编码的有限状态熵约束矢量量化器
5. Volume data compression using model-based quantization and three-dimensional zerotree encoding of wavelet coefficients. [D] . Pratt, Michael Anthony, Sr. 2003

机译：使用基于模型的量化和小波系数的三维零树编码进行体积数据压缩。
6. Quantization of collagen organization in the stroma with a new order coefficient [O] . James A. Germann, Eduardo Martinez-Enriquez, Susana Marcos 2018

机译：用新的阶数系数量化基质中的胶原组织
7. A Search Complexity Improvement of Vector Quantization to Immittance Spectral Frequency Coefficients in AMR-WB Speech Codec [O] . Bing-Jhih Yao, Cheng-Yu Yeh, Shaw-Hwa Hwang 2016

机译：amR-WB语音编解码器中导纳谱频率系数矢量量化的搜索复杂度改进

Companded quantization of speech MDCT coefficients

摘要

著录项

相似文献

相关主题

期刊订阅