首页> 外文会议>2012 International Conference on Signal Processing and Communications >Ultra low bit-rate speech coding: An overview and recent results
【24h】

Ultra low bit-rate speech coding: An overview and recent results

机译:超低比特率语音编码:概述和最新结果

获取原文
获取原文并翻译 | 示例

摘要

In narrow-band speech coding, specifically in the low and ultra low bit-rate ranges, a series of efficient quantization of the LP parameters using fixed-length as well as variable-length segment quantization (VLSQ) have resulted in a progressive reduction in the bit-rate from the 2400 bits/sec baseline of the LPC-10 coder down to 300 bits/sec and less. The VLSQ framework forms a generic basis of a class of segment vocoders within which various types of segments/units and unit-modeling have been explored, such as phones (in the phonetic vocoder), automatically derived units (phones and diphones), R/D optimal linear prediction, HMM based recognition-synthesis and unit-selection based paradigms. Recently, set within the original unit-selection framework, we proposed a joint spectral-residual quantization scheme which obviates the need for transmitting any side information about the residual of the input speech, offering up to 2dB spectral distortion at 250 bits/sec. In this paper, in order to realize better rate-distortion performance, we propose joint spectral-residual quantization in an optimal unit-selection framework based on a modified one-pass dynamic programming (DP) algorithm.
机译:在窄带语音编码中,特别是在低和超低比特率范围内,使用固定长度和可变长度段量化(VLSQ)对LP参数进行的一系列有效量化已导致逐渐降低了从LPC-10编码器的2400比特/秒的基线到300比特/秒甚至更低的比特率。 VLSQ框架是一类分段声码器的通用基础,在其中探讨了各种类型的分段/单元和单元建模,例如电话(在语音声码器中),自动派生的单元(电话和双音器),R /最佳线性预测,基于HMM的识别合成和基于单元选择的范式。最近,在原始的单元选择框架内,我们提出了一种联合频谱残差量化方案,该方案消除了传输有关输入语音残差的任何辅助信息的需要,在250位/秒时提供高达2dB的频谱失真。在本文中,为了实现更好的速率失真性能,我们提出了一种基于改进的单程动态规划(DP)算法的最优单元选择框架中的联合频谱残留量化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号