首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >A Phase Vocoder Based on Nonstationary Gabor Frames
【24h】

A Phase Vocoder Based on Nonstationary Gabor Frames

机译:基于非平稳Gabor帧的相位声码器

获取原文
获取原文并翻译 | 示例

摘要

We propose a new algorithm for time stretching music signals based on the theory of nonstationary Gabor frames (NSGFs). The algorithm extends the techniques of the classical phase vocoder (PV) by incorporating adaptive time-frequency (TF) representations and adaptive phase locking. The adaptive TF representations imply good time resolution for the onsets of attack transients and good frequency resolution for the sinusoidal components. We estimate the phase values only at peak channels and the remaining phases are then locked to the values of the peaks in an adaptive manner. During attack transients we keep the stretch factor equal to one and we propose a new strategy for determining which channels are relevant for reinitializing the corresponding phase values. In contrast to previously published algorithms we use a non-uniform NSGF to obtain a low redundancy of the corresponding TF representation. We show that with just three times as many TF coefficients as signal samples, artifacts such as phasiness and transient smearing can be greatly reduced compared to the classical PV. The proposed algorithm is tested on both synthetic and real-world signals and compared with state-of-the-art algorithms in a reproducible manner.
机译:我们提出了一种基于非平稳Gabor帧(NSGF)理论的时间拉伸音乐信号的新算法。该算法通过结合自适应时频(TF)表示和自适应锁相,扩展了经典相位声码器(PV)的技术。自适应TF表示对于攻击瞬变的发作意味着良好的时间分辨率,对于正弦分量意味着良好的频率分辨率。我们仅在峰值通道处估计相位值,然后以自适应方式将其余相位锁定到峰值。在攻击瞬变期间,我们将拉伸因子保持等于1,并提出了一种新的策略,用于确定哪些通道与重新初始化相应的相位值相关。与以前发布的算法相反,我们使用非均匀NSGF来获得相应TF表示的低冗余。我们证明,与传统的PV相比,TF系数仅为信号样本的三倍,可以大大减少诸如相位和瞬时拖尾等伪影。所提出的算法在合成信号和现实信号上均经过测试,并以可重复的方式与最新算法进行比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号