Ultra low bit-rate speech coding: An overview and recent results

机译：超低比特率语音编码：概述和最新结果

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In narrow-band speech coding, specifically in the low and ultra low bit-rate ranges, a series of efficient quantization of the LP parameters using fixed-length as well as variable-length segment quantization (VLSQ) have resulted in a progressive reduction in the bit-rate from the 2400 bits/sec baseline of the LPC-10 coder down to 300 bits/sec and less. The VLSQ framework forms a generic basis of a class of segment vocoders within which various types of segments/units and unit-modeling have been explored, such as phones (in the phonetic vocoder), automatically derived units (phones and diphones), R/D optimal linear prediction, HMM based recognition-synthesis and unit-selection based paradigms. Recently, set within the original unit-selection framework, we proposed a joint spectral-residual quantization scheme which obviates the need for transmitting any side information about the residual of the input speech, offering up to 2dB spectral distortion at 250 bits/sec. In this paper, in order to realize better rate-distortion performance, we propose joint spectral-residual quantization in an optimal unit-selection framework based on a modified one-pass dynamic programming (DP) algorithm.

机译：在窄带语音编码中，特别是在低和超低比特率范围内，使用固定长度和可变长度段量化（VLSQ）对LP参数进行的一系列有效量化已导致逐渐降低了从LPC-10编码器的2400比特/秒的基线到300比特/秒甚至更低的比特率。 VLSQ框架是一类分段声码器的通用基础，在其中探讨了各种类型的分段/单元和单元建模，例如电话（在语音声码器中），自动派生的单元（电话和双音器），R /最佳线性预测，基于HMM的识别合成和基于单元选择的范式。最近，在原始的单元选择框架内，我们提出了一种联合频谱残差量化方案，该方案消除了传输有关输入语音残差的任何辅助信息的需要，在250位/秒时提供高达2dB的频谱失真。在本文中，为了实现更好的速率失真性能，我们提出了一种基于改进的单程动态规划（DP）算法的最优单元选择框架中的联合频谱残留量化。

著录项

来源
《2012 International Conference on Signal Processing and Communications》|2012年|p.1- 5|共5页
会议地点 Bangalore(IN)
作者
Ramasubramanian V.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类信号处理;
关键词

相似文献

外文文献
中文文献
专利

1. Adaptive Long-Term Coding of LSF Parameters Trajectories for Large-Delay/Very- to Ultra-Low Bit-Rate Speech Coding [J] . Laurent Girin EURASIP journal on audio, speech, and music processing . 2010,第2期

机译：大延迟/非常低至超低比特率语音编码的LSF参数轨迹的自适应长期编码
2. Adaptive Long-Term Coding of LSF Parameters Trajectories for Large-Delay/Very- to Ultra-Low Bit-Rate Speech Coding [J] . Laurent Girin EURASIP journal on audio, speech, and music processing . 2010,第1期

机译：大延迟/非常低至超低比特率语音编码的LSF参数轨迹的自适应长期编码
3. Low Bit-rate Coded Speech Intelligibility Tested with Parallel Task [J] . Avetisyan Hakob, Drabek Tomas, Holub Jan Acta acustica united with acustica . 2018,第4期

机译：使用并行任务测试的低比特率编码语音可懂度
4. Ultra low bit-rate speech coding: An overview and recent results [C] . Ramasubramanian V. International Conference on Signal Processing and Communications . 2012

机译：超低比特率语音编码：概述和最近的结果
5. Objective speech intelligibility assessment using speech recognition and bigram statistics with application to low bit-rate codec evaluation [D] . Teng, Yan 2006

机译：使用语音识别和双字母组统计的客观语音清晰度评估及其在低比特率编解码器评估中的应用
6. Progressive Dictionary Learning With Hierarchical PredictiveStructure for Low Bit-Rate Scalable Video Coding [O] . Wenrui Dai, Yangmei Shen, Hongkai Xiong, -1

机译：分层预测的渐进式字典学习低比特率可伸缩视频编码的结构
7. Adaptive Long-Term Coding of LSF Parameters Trajectories for Large-Delay/Very- to Ultra-Low Bit-Rate Speech Coding [O] . Laurent Girin 2010

机译：大延迟/非常低至超低比特率语音编码的LSF参数轨迹的自适应长期编码

Ultra low bit-rate speech coding: An overview and recent results

摘要

著录项

相似文献

相关主题

期刊订阅