首页> 外文会议>AES international conference >'Sparsification' of Audio Signals using the MDCT/lntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation
【24h】

'Sparsification' of Audio Signals using the MDCT/lntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation

机译:使用MDCT / lntMDCT和心理声学模型对音频信号进行“稀疏化”-在信息音频源分离中的应用

获取原文

摘要

Sparse representations have proved a very useful tool in a variety of domain, e.g. speech/music source separation. As strictly sparse representations (in the sense of e~0) are often impossible to achieve, other ways of studying signals sparsity have been proposed. In this paper, we revisit the irrelevance filtering analysis-synthesis approach proposed in (Balazs et al., IEEE Trans. ASLP, 18(1), 2010), where the TF coefficients that are below some masking threshold are set to zero. Instead of using the Gabor transform and a specific psychoacoustic model, we use tools directly inspired from perceptual audio coding, for instance MPEG-AAC. We show that significantly better "sparsification performances" are obtained on music signals, at lower computational cost. We then apply the sparsification process to the informed source separation (ISS) problem and show that it enables to significantly decrease the computational cost at the ISS decoder.
机译:稀疏表示已被证明在多种领域中都是非常有用的工具,例如语音/音乐来源分离。由于通常很难实现严格稀疏的表示(在e〜0的意义上),因此提出了研究信号稀疏性的其他方法。在本文中,我们将重新讨论(Balazs等,IEEE Trans。ASLP,18(1),2010)中提出的不相关滤波分析-合成方法,其中将低于某些掩蔽阈值的TF系数设置为零。代替使用Gabor变换和特定的心理声学模型,我们使用直接受感知音频编码启发的工具,例如MPEG-AAC。我们表明,以较低的计算成本,可以在音乐信号上获得明显更好的“分类性能”。然后,我们将稀疏化过程应用于知情源分离(ISS)问题,并证明它可以显着降低ISS解码器的计算成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号