首页> 外国专利> AUDIO SIGNAL COMPRESSION METHOD, AUDIO SIGNAL COMPRESSION APPARATUS, SPEECH SIGNAL COMPRESSION METHOD, SPEECH SIGNAL COMPRESSION APPARATUS, SPEECH RECOGNITION METHOD, AND SPEECH RECOGNITION APPARATUS

AUDIO SIGNAL COMPRESSION METHOD, AUDIO SIGNAL COMPRESSION APPARATUS, SPEECH SIGNAL COMPRESSION METHOD, SPEECH SIGNAL COMPRESSION APPARATUS, SPEECH RECOGNITION METHOD, AND SPEECH RECOGNITION APPARATUS

机译:音频信号压缩方法,音频信号压缩设备,语音信号压缩方法,语音信号压缩设备,语音识别方法和语音识别设备

摘要

An audio signal compression apparatus for compressively coding an input audio signal comprises a time-to-frequency transformation unit for transforming the input audio signal to a frequency domain signal; a spectrum envelope calculation unit for calculating a spectrum envelope having different resolutions for different frequencies, from the input audio signal, using a weighting function on frequency based on human auditory characteristics; a normalization unit for normalizing the frequency domain signal using the spectrum envelope to obtain a residual signal; a power normalization unit for normalizing the residual signal by the power; an auditory weighting calculation unit for calculating weighting coefficients on frequency, based on the spectrum of the input audio signal and human auditory characteristics; and a multi-stage quantization means having plural stages of vector quantizers connected in series, to which the normalized residual signal is input, and at least one of the vector quantizers quantizing the residual signal using the weighting coefficients. Therefore, a low frequency band, which is auditively important, can be analyzed with a higher frequency resolution as compared with a high frequency band, whereby efficient signal compression utilizing human auditory characteristics is realized.
机译:一种用于对输入音频信号进行压缩编码的音频信号压缩设备,包括:时频变换单元,用于将输入音频信号变换为频域信号;以及频谱包络计算单元,用于使用基于人类听觉特性的频率加权函数,根据输入的音频信号,针对输入的音频信号来计算针对不同频率具有不同分辨率的频谱包络;归一化单元,用于使用频谱包络对频域信号进行归一化以获得残差信号;功率归一化单元,用于通过功率归一化残余信号;听觉加权计算单元,用于基于输入音频信号的频谱和人类听觉特征来计算频率上的加权系数;一种多级量化装置,其具有串联连接的多级矢量量化器,向其输入归一化残余信号,并且至少一个矢量量化器使用加权系数来量化残余信号。因此,与高频带相比,可以以较高的频率分辨率分析在听觉上重要的低频带,从而实现了利用人的听觉特性的有效信号压缩。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号