首页> 外文会议>2016 IEEE International Workshop on Acoustic Signal Enhancement >Dynamic group sparsity for non-negative matrix factorization with application to unsupervised source separation
【24h】

Dynamic group sparsity for non-negative matrix factorization with application to unsupervised source separation

机译:用于非负矩阵分解的动态群稀疏性及其在无监督源分离中的应用

获取原文
获取原文并翻译 | 示例

摘要

Non-negative matrix factorization (NMF) is an appealing technique for audio source separation. Sparsity constraints are commonly used on the NMF model to discover a small number of dominant patterns. Recently, group sparsity has been proposed for NMF based methods, in which basis vectors belonging to a same group are permitted to activate together, while activations across groups are suppressed. However, most group sparsity functions activate the groups in a global manner without considering the dynamics of the speech spectra in different frames. In this paper, we propose dynamic group sparsity to model both the spectral dynamics and the temporal continuity of the speech signal and investigate its potential benefit to separate speech from other sound sources. Experimental results show that the proposed dynamic group sparsity improves the performance over regular group sparsity in unsupervised settings where neither the speaker identity nor the type of noise is known in advance.
机译:非负矩阵分解(NMF)是一种用于音频源分离的吸引人的技术。 NMF模型通常使用稀疏约束来发现少量主导模式。最近,已经提出了基于NMF的方法的组稀疏性,其中允许属于同一组的基础向量一起激活,而跨组的激活被抑制。但是,大多数组稀疏函数以全局方式激活组,而不考虑不同帧中语音频谱的动态。在本文中,我们提出了动态群稀疏性来对语音信号的频谱动力学和时间连续性进行建模,并研究其将语音与其他声源分离的潜在好处。实验结果表明,在无人看管的情况下,提出的动态组稀疏度比常规的组稀疏度提高了性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号