A Phase Vocoder Based on Nonstationary Gabor Frames

Emil Solsbæk Ottosen; Monika Dörfler

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE/ACM Transactions on >A Phase Vocoder Based on Nonstationary Gabor Frames

【24h】

A Phase Vocoder Based on Nonstationary Gabor Frames

机译：基于非平稳Gabor帧的相位声码器

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We propose a new algorithm for time stretching music signals based on the theory of nonstationary Gabor frames (NSGFs). The algorithm extends the techniques of the classical phase vocoder (PV) by incorporating adaptive time-frequency (TF) representations and adaptive phase locking. The adaptive TF representations imply good time resolution for the onsets of attack transients and good frequency resolution for the sinusoidal components. We estimate the phase values only at peak channels and the remaining phases are then locked to the values of the peaks in an adaptive manner. During attack transients we keep the stretch factor equal to one and we propose a new strategy for determining which channels are relevant for reinitializing the corresponding phase values. In contrast to previously published algorithms we use a non-uniform NSGF to obtain a low redundancy of the corresponding TF representation. We show that with just three times as many TF coefficients as signal samples, artifacts such as phasiness and transient smearing can be greatly reduced compared to the classical PV. The proposed algorithm is tested on both synthetic and real-world signals and compared with state-of-the-art algorithms in a reproducible manner.

机译：我们提出了一种基于非平稳Gabor帧（NSGF）理论的时间拉伸音乐信号的新算法。该算法通过结合自适应时频（TF）表示和自适应锁相，扩展了经典相位声码器（PV）的技术。自适应TF表示对于攻击瞬变的发作意味着良好的时间分辨率，对于正弦分量意味着良好的频率分辨率。我们仅在峰值通道处估计相位值，然后以自适应方式将其余相位锁定到峰值。在攻击瞬变期间，我们将拉伸因子保持等于1，并提出了一种新的策略，用于确定哪些通道与重新初始化相应的相位值相关。与以前发布的算法相反，我们使用非均匀NSGF来获得相应TF表示的低冗余。我们证明，与传统的PV相比，TF系数仅为信号样本的三倍，可以大大减少诸如相位和瞬时拖尾等伪影。所提出的算法在合成信号和现实信号上均经过测试，并以可重复的方式与最新算法进行比较。

著录项

来源
《Audio, Speech, and Language Processing, IEEE/ACM Transactions on》 |2017年第11期|2199-2208|共10页
作者
Emil Solsbæk Ottosen; Monika Dörfler;
展开▼
作者单位

Department of Mathematical Sciences, Aalborg University, Aalborg, Denmark;

Faculty of Mathematics, University of Vienna, Wien, Austria;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Transient analysis; Time-frequency analysis; Manganese; Vocoders; Signal resolution; Redundancy; Speech;

机译：瞬态分析时频分析锰声码器信号分辨率冗余语音;

相似文献

外文文献
中文文献
专利

1. Nonstationary Gabor frames - approximately dual frames and reconstruction errors [J] . Doerfler Monika, Matusiak Ewa Advances in computational mathematics . 2015,第2期

机译：非平稳Gabor帧-近似双帧和重构误差
2. Vector-valued nonstationary Gabor frames [J] . Lian Qiaofang, Song Linlin Journal of Mathematical Analysis and Applications . 2019,第1期

机译：矢量值的非寓言Gabor框架
3. Nonstationary Gabor frames for l2(Z) [J] . Lian Qiaofang Mathematical Methods in the Applied Sciences . 2019,第5期

机译：L2（z）的非营养性Gabor框架
4. Intraframe time-scaling of nonstationary sinusoids within the phase vocoder [C] . Bristow-Johnson, R., Bogdanowicz, . 2001

机译：声码器内非平稳正弦波的帧内时间换算
5. The Empirical Gabor Transform and Empirical Gabor Frames [D] . Silveti-Falls, Antonio José. 2017

机译：经验杰布尔变换与经验杰布尔框架
6. Theory implementation and applications of nonstationary Gabor frames [O] . P. Balazs, M. Dörfler, F. Jaillet, -1

机译：非平稳Gabor框架的理论实现与应用
7. A Phase Vocoder Based on Nonstationary Gabor Frames [O] . Ottosen, Emil Solsbæk, Dörfler, Monika 2017

机译：基于非平稳Gabor帧的相位声码器

A Phase Vocoder Based on Nonstationary Gabor Frames

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅