【24h】

Exploiting the harmonic structure for speech enhancement

机译:利用谐波结构进行语音增强

获取原文

摘要

We provide a single channel speech enhancement method leveraging the harmonic structure of voiced speech. A sinusoidal model, based on the pitch of the speaker, is used to filter noisy speech and remove any noise components that lie between the harmonics. To remove noise that lie on each harmonic frequency, we use a noise estimation procedure that exploits spectral sparsity of voiced speech. By measuring the power spectrum at frequencies that correspond to the zero crossings of the windowing function, we can estimate the noise levels even in frames that have voiced speech. We also provide a constrained linear least squares formulation to reduce “musical noise” which arises from difficulty in estimating speech and noise power spectral densities. We show that our method yields high perceptual performance over existing methods, and can easily adapt to conditions in which the noise characteristics are constantly changing.
机译:我们提供了一种利用语音的谐波结构的单通道语音增强方法。基于扬声器音调的正弦模型用于过滤嘈杂的语音并消除谐波之间的任何噪声成分。为了消除每个谐波频率上的噪声,我们使用了噪声估计程序,该程序利用了浊语音的频谱稀疏性。通过测量与开窗函数的零交叉点相对应的频率处的功率谱,即使在有语音的帧中,我们也可以估计噪声水平。我们还提供了一种受约束的线性最小二乘公式,以减少由于估计语音和噪声功率谱密度的困难而产生的“音乐噪声”。我们表明,与现有方法相比,我们的方法具有较高的感知性能,并且可以轻松适应噪声特性不断变化的条件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号