首页> 外国专利> Audio signal time-scale modification method using variable length synthesis and reduced cross-correlation computations

Audio signal time-scale modification method using variable length synthesis and reduced cross-correlation computations

机译:使用可变长度合成和减少互相关计算的音频信号时标修改方法

摘要

Disclosed is an audio signal time-scale modification which utilizes variable length synthesis for the improvement of output audio quality and reduced cross-correlation computations for the reduction of computation loads to a processor. An analysis window consisting of N+Kmax audio samples is selected from an input audio samples and is shifted by the predetermined interval along output audio samples to find optimal shift Km, which ensures best cross-correlation between Nov audio samples of the analysis window and last Nov audio samples of the output audio samples and a particular value of Nm at which a coefficient of correlation between them is larger than a reference value or is the maximum one among a plurality of coefficients of correlation calculated with varying the value of Nov. The audio samples involved in the calculation of cross-correlation are down-selected by the predetermined ratio from Nov audio samples of the analysis window and last Nov audio samples of the output audio samples, respectively. The analysis window may also be shifted by the plurality of audio samples per one shift. The audio samples ranged region (Km+Nov−Nm)th sample in the analysis window is determined as an add frame. The existing last Nm audio samples of the output audio samples are replaced with new Nm audio samples obtained by weighting and adding the overlapped parts, i.e., the first Nm audio samples of the add frame and the last Nm audio samples of the output audio samples, while remaining part of the add frame is simply appended to the tail of the new Nm audio samples in the output audio samples.
机译:公开了一种音频信号时标修改,其利用可变长度合成来改善输出音频质量,并利用减少的互相关计算来减少处理器的计算负荷。从输入音频样本中选择一个包含N + Kmax个音频样本的分析窗口,并沿着输出音频样本将其偏移预定间隔,以找到最佳偏移Km,从而确保分析窗口的Nov音频样本与最后一个音频样本之间的最佳互相关输出音频样本的Nov音频样本以及Nm的特定值,在该特定值处,它们之间的相关系数大于参考值,或者是通过更改Nov值而计算出的多个相关系数中的最大值。分别从分析窗口的11月音频样本和输出音频样本的最后11月音频样本中按预定比例向下选择涉及互相关计算的样本。分析窗口还可每移位一班,移动多个音频样本。将分析窗口中的音频样本范围区域(Km + Nov-Nm) 样本确定为加帧。输出音频样本的现有最后Nm个音频样本被替换为新的Nm个音频样本,这些新的Nm音频样本通过对重叠部分进行加权和相加而获得,即加法帧的前Nm个音频样本和输出音频样本的最后Nm个音频样本,而添加帧的其余部分仅附加到输出音频样本中新的Nm个音频样本的尾部。

著录项

  • 公开/公告号US2005273321A1

    专利类型

  • 公开/公告日2005-12-08

    原文格式PDF

  • 申请/专利权人 WON YONG CHOI;

    申请/专利号US20050523923

  • 发明设计人 WON YONG CHOI;

    申请日2002-08-08

  • 分类号G10L11/04;

  • 国家 US

  • 入库时间 2022-08-21 21:41:30

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号