首页> 外国专利> Tonal analysis for perceptual audio coding using a compressed spectral representation

Tonal analysis for perceptual audio coding using a compressed spectral representation

机译:使用压缩频谱表示的感知音频编码的音调分析

摘要

The present invention provides an apparatus, method and tangible medium storing instructions for determining tonality of an input audio signal, for selection of corresponding masked thresholds for use in perceptual audio coding. In the various embodiments, the input audio signal is sampled and transformed using a compressed spectral operation to form a compressed spectral representation, such as a cepstral representation. A peak magnitude and an average magnitude of the compressed spectral representation are determined. Depending upon the ratio of peak-to-average magnitudes, a masked threshold is selected having a corresponding degree of tonality, and is used to determine a plurality of quantization levels and a plurality of bit allocations to perceptually encode the input audio signal with a distortion spectrum beneath a level of just noticeable distortion (JND). The invention also includes other methods and variations for selecting substantially tone-like or substantially noise-like masked thresholds for perceptual encoding of the input audio signal.
机译:本发明提供一种设备,方法和有形介质,该设备,方法和有形介质存储用于确定输入音频信号的音调,用于选择用于感知音频编码的相应掩蔽阈值的指令。在各个实施例中,使用压缩频谱操作对输入音频信号进行采样和变换,以形成压缩频谱表示,例如倒频谱表示。确定压缩频谱表示的峰值大小和平均大小。取决于峰均值的比率,选择具有相应音调程度的掩蔽阈值,并将其用于确定多个量化级别和多个位分配,以感知地编码失真的输入音频信号低于明显失真(JND)的频谱。本发明还包括用于选择用于输入音频信号的感知编码的基本上像音调或基本上像噪声的掩蔽阈值的其他方法和变形。

著录项

  • 公开/公告号US2004181393A1

    专利类型

  • 公开/公告日2004-09-16

    原文格式PDF

  • 申请/专利权人 AGERE SYSTEMS INC.;

    申请/专利号US20030389000

  • 发明设计人 FRANK BAUMGARTE;

    申请日2003-03-14

  • 分类号G10L19/00;

  • 国家 US

  • 入库时间 2022-08-21 23:22:23

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号