首页> 外国专利> Scalable speech and audio encoding using combinatorial encoding of MDCT spectrum

Scalable speech and audio encoding using combinatorial encoding of MDCT spectrum

机译:使用MDCT频谱组合编码的可扩展语音和音频编码

摘要

A scalable speech and audio codec is provided that implements combinatorial spectrum encoding. A residual signal is obtained from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal is transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines. The transform spectrum spectral lines are transformed using a combinatorial position coding technique. The combinatorial position coding technique includes generating a lexicographical index for a selected subset of spectral lines, where each lexicographic index represents one of a plurality of possible binary strings representing the positions of the selected subset of spectral lines. The lexicographical index represents non-zero spectral lines in a binary string in fewer bits than the length of the binary string.
机译:提供了实现组合频谱编码的可伸缩语音和音频编解码器。从基于码激励线性预测(CELP)的编码层获得残差信号,其中残差信号是原始音频信号与原始音频信号的重构版本之间的差。残留信号在离散余弦变换(DCT)类型的变换层处进行变换,以获得具有多条谱线的对应变换谱。使用组合位置编码技术来变换变换频谱谱线。组合位置编码技术包括为谱线的选定子集生成词典索引,其中每个词典索引代表表示谱线的选定子集的位置的多个可能的二进制串之一。字典索引表示二进制字符串中非零频谱线的位数少于二进制字符串的长度。

著录项

  • 公开/公告号AU2008316860B2

    专利类型

  • 公开/公告日2011-06-16

    原文格式PDF

  • 申请/专利权人 QUALCOMM INCORPORATED;

    申请/专利号AU20080316860

  • 发明设计人 YURIY REZNIK;PENGJUN HUANG;

    申请日2008-10-22

  • 分类号G10L19/14;G10L19/02;

  • 国家 AU

  • 入库时间 2022-08-21 18:01:14

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号