首页> 外文会议>2017 IEEE 27th International Workshop on Machine Learning for Signal Processing >Infinite probabilistic latent component analysis for audio source separation
【24h】

Infinite probabilistic latent component analysis for audio source separation

机译:无限概率潜在成分分析用于音频源分离

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a statistical method of audio source separation based on a nonparametric Bayesian extension of probabilistic latent component analysis (PLCA). A major approach to audio source separation is to use nonnegative matrix factorization (NMF) that approximates the magnitude spectrum of a mixture signal at each frame as the weighted sum of fewer source spectra. Another approach is to use PLCA that regards the magnitude spectrogram as a two-dimensional histogram of “sound quanta” and classifies each quantum into one of sources. While NMF has a physically-natural interpretation, PLCA has been used successfully for music signal analysis. To enable PLCA to estimate the number of sources, we propose Dirichlet process PLCA (DP-PLCA) and derive two kinds of learning methods based on variational Bayes and collapsed Gibbs sampling. Unlike existing learning methods for nonparametric Bayesian NMF based on the beta or gamma processes (BP-NMF and GaP-NMF), our sampling method can efficiently search for the optimal number of sources without truncating the number of sources to be considered. Experimental results showFed that DP-PLCA is superior to GaP-NMF in terms of source number estimation.
机译:本文提出了一种基于概率潜在成分分析(PLCA)的非参数贝叶斯扩展的音频源分离统计方法。音频源分离的一种主要方法是使用非负矩阵分解(NMF),该矩阵将每个帧处混合信号的幅度谱近似为较少源谱的加权和。另一种方法是使用PLCA,将幅度谱图视为“声量子”的二维直方图,并将每个量子分类为一个源。虽然NMF具有自然的解释,但PLCA已成功用于音乐信号分析。为了使PLCA能够估计来源数量,我们提出了Dirichlet过程PLCA(DP-PLCA),并基于变分贝叶斯和折叠Gibbs采样推导了两种学习方法。与基于beta或gamma过程(BP-NMF和GaP-NMF)的非参数贝叶斯NMF的现有学习方法不同,我们的采样方法可以有效地搜索最佳源数,而不会缩短要考虑的源数。实验结果表明,就源数估计而言,DP-PLCA优于GaP-NMF。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号