【24h】

VERY LOW BIT RATE (VLBR) SPEECH CODING AROUND 500 BITS/SEC

机译:约500位/秒的极低比特率(VLBR)语音编码

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

New solutions to Very Low Bit Rate speech coding have been recently proposed based on speech recognition and speech synthesis technologies. In the continuation of the work described in [8], this paper presents a complete encoding scheme around 500 bits/sec. The proposed solution is based on automatic recognition of elementary acoustical units using HMM modelling. An unsupervised training phase is used to build the HMM models and the codebook of synthesis units. The decoded speech is then obtained by concatenating the corresponding synthesis units based on a HNM-like decomposition of speech. A new unit selection process is proposed integrating some prosody constraints. Through this approach, the size of the synthesis codebook is independent of the targeted bit rate. A complete description of the unit selection process and of the associated prosody modelling is given, together with the quantisation scheme of the overall set of encoded parameters.
机译:最近已经基于语音识别和语音合成技术提出了针对超低比特率语音编码的新解决方案。在[8]中描述的工作的继续中,本文提出了大约500位/秒的完整编码方案。所提出的解决方案基于使用HMM建模对基本声学单元的自动识别。无监督的训练阶段用于构建HMM模型和综合单位的密码本。然后,通过基于类似于语音的HNM分解的级联相应的合成单元来获得解码的语音。提出了一种新的单元选择过程,结合了一些韵律约束。通过这种方法,综合码本的大小与目标比特率无关。给出了单元选择过程和相关的韵律模型的完整描述,以及整个编码参数集的量化方案。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号