首页> 外国专利> SCALABLE SPEECH AND AUDIO ENCODING USING COMBINATORIAL ENCODING OF MDCT SPECTRUM

SCALABLE SPEECH AND AUDIO ENCODING USING COMBINATORIAL ENCODING OF MDCT SPECTRUM

机译:使用MDCT谱的组合编码进行可缩放的语音和音频编码

摘要

A scalable speech and audio codec is provided that implements combinatorial spectrum encoding. A residual signal is obtained from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal is transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines. The transform spectrum spectral lines are transformed using a combinatorial position coding technique. The combinatorial position coding technique includes generating a lexicographical index for a selected subset of spectral lines, where each lexicographic index represents one of a plurality of possible binary strings representing the positions of the selected subset of spectral lines. The lexicographical index represents non-zero spectral lines in a binary string in fewer bits than the length of the binary string.
机译:提供了实现组合频谱编码的可伸缩语音和音频编解码器。从基于码激励线性预测(CELP)的编码层获得残差信号,其中残差信号是原始音频信号与原始音频信号的重构版本之间的差。残留信号在离散余弦变换(DCT)类型的变换层处进行变换,以获得具有多条谱线的对应变换谱。使用组合位置编码技术来变换变换频谱谱线。组合位置编码技术包括为谱线的选定子集生成词典索引,其中每个词典索引代表表示谱线的选定子集的位置的多个可能的二进制串之一。字典索引表示二进制字符串中非零频谱线的位数少于二进制字符串的长度。

著录项

  • 公开/公告号EP2255358B1

    专利类型

  • 公开/公告日2013-07-03

    原文格式PDF

  • 申请/专利权人 QUALCOMM INC;

    申请/专利号EP20080843220

  • 发明设计人 REZNIK YURIY;HUANG PENGJUN;

    申请日2008-10-22

  • 分类号G10L19/24;G10L19/03;

  • 国家 EP

  • 入库时间 2022-08-21 16:33:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号