首页> 外文会议>AES international conference >'Sparsification' of Audio Signals using the MDCT/lntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation

【24h】

'Sparsification' of Audio Signals using the MDCT/lntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation

机译：使用MDCT / lntMDCT和心理声学模型对音频信号进行“稀疏化”-在信息音频源分离中的应用

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Sparse representations have proved a very useful tool in a variety of domain, e.g. speech/music source separation. As strictly sparse representations (in the sense of e~0) are often impossible to achieve, other ways of studying signals sparsity have been proposed. In this paper, we revisit the irrelevance filtering analysis-synthesis approach proposed in (Balazs et al., IEEE Trans. ASLP, 18(1), 2010), where the TF coefficients that are below some masking threshold are set to zero. Instead of using the Gabor transform and a specific psychoacoustic model, we use tools directly inspired from perceptual audio coding, for instance MPEG-AAC. We show that significantly better "sparsification performances" are obtained on music signals, at lower computational cost. We then apply the sparsification process to the informed source separation (ISS) problem and show that it enables to significantly decrease the computational cost at the ISS decoder.

机译：稀疏表示已被证明在多种领域中都是非常有用的工具，例如语音/音乐来源分离。由于通常很难实现严格稀疏的表示（在e〜0的意义上），因此提出了研究信号稀疏性的其他方法。在本文中，我们将重新讨论（Balazs等，IEEE Trans。ASLP，18（1），2010）中提出的不相关滤波分析-合成方法，其中将低于某些掩蔽阈值的TF系数设置为零。代替使用Gabor变换和特定的心理声学模型，我们使用直接受感知音频编码启发的工具，例如MPEG-AAC。我们表明，以较低的计算成本，可以在音乐信号上获得明显更好的“分类性能”。然后，我们将稀疏化过程应用于知情源分离（ISS）问题，并证明它可以显着降低ISS解码器的计算成本。

著录项

来源
《AES international conference》|2011年|p.179-187|共9页
会议地点
作者
Jonathan Pinel; Laurent Girin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类声学工程;
关键词

相似文献

外文文献
中文文献
专利

1. A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor [J] . Parvaix M., Girin L., Brossier J.-M. Audio, Speech, and Language Processing, IEEE Transactions on . 2010,第6期

机译：基于水印的单传感器音频信号信息源分离方法
2. Informed Source Separation of Linear Instantaneous Under-Determined Audio Mixtures by Source Index Embedding [J] . Parvaix M., Girin L. Audio, Speech, and Language Processing, IEEE Transactions on . 2011,第6期

机译：通过源索引嵌入对线性瞬时欠定混合音频进行知情源分离
3. [Invited Talk] Statistical Representation of Binaural Signal and Its Application to Audio Source Separation [J] . Hiroshi SARUWATARI 電子情報通信学会技術研究報告. 応用音響. Engineering Acoustics . 2015,第126期

机译：[特邀演讲]双耳信号的统计表示及其在音频源分离中的应用
4. "Sparsification" of Audio Signals using the MDCT/IntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation [C] . Jonathan Pinel, Laurent Girin Audio Engineering Society International Conference . 2011

机译：使用MDCT / INTMDCT和精神声学模型的音频信号的“稀疏” - 应用于通知音频源分离
5. A modified extended Kalman filter approach to demodulation of AM-FM signals and its applications to audio and speech signals. [D] . Pai, Wan-Chieh. 1998

机译：一种改进的扩展卡尔曼滤波器方法，用于解调AM-FM信号及其在音频和语音信号中的应用。
6. pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis [O] . Theodoros Giannakopoulos 2010

机译：pyAudioAnalysis：用于音频信号分析的开源Python库
7. A Watermarking-Based Method for Informed Source Separation of Audio Signals with a Single Sensor [O] . Parvaix, Mathieu, Girin, Laurent, Brossier, Jean-Marc 2010

机译：基于水印的单传感器音频信号信息源分离方法
8. Audio Signal Separation Using Blind Source Separation Techniques [R] . Kadambe, S., Owechko, Y. 1999

机译：采用盲源分离技术的音频信号分离

'Sparsification' of Audio Signals using the MDCT/lntMDCT and a Psychoacoustic Model - Application to Informed Audio Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅