首页> 外文期刊>IEEE Transactions on Speech and Audio Proceeding >A perceptual subspace approach for modeling of speech and audio signals with damped sinusoids
【24h】

A perceptual subspace approach for modeling of speech and audio signals with damped sinusoids

机译:具有阻尼正弦曲线的语音和音频信号建模的感知子空间方法

获取原文
获取原文并翻译 | 示例
       

摘要

The problem of modeling a signal segment as a sum of exponentially damped sinusoidal components arises in many different application areas, including speech and audio processing. Often, model parameters are estimated using subspace based techniques which arrange the input signal in a structured matrix and exploit the so-called shift-invariance property related to certain vector spaces of the input matrix. A problem with this class of estimation algorithms, when used for speech and audio processing, is that the perceptual importance of the sinusoidal components is not taken into account. In this work we propose a solution to this problem. In particular, we show how to combine well-known subspace based estimation techniques with a recently developed perceptual distortion measure, in order to obtain an algorithm for extracting perceptually relevant model components. In analysis-synthesis experiments with wideband audio signals, objective and subjective evaluations show that the proposed algorithm improves perceived signal quality considerable over traditional subspace based analysis methods.
机译:将信号段建模为指数阻尼正弦分量之和的问题出现在许多不同的应用领域,包括语音和音频处理。通常,使用基于子空间的技术来估计模型参数,这些技术将输入信号排列在结构化的矩阵中,并利用与输入矩阵的某些向量空间有关的所谓的位移不变性。当用于语音和音频处理时,此类估计算法的问题在于,正弦分量的感知重要性未得到考虑。在这项工作中,我们提出了解决该问题的方法。特别是,我们展示了如何将众所周知的基于子空间的估计技术与最近开发的感知失真度量相结合,以获取用于提取感知相关模型分量的算法。在宽带音频信号的分析综合实验中,客观和主观评估表明,与传统的基于子空间的分析方法相比,该算法可显着提高感知信号的质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号