...
首页> 外文期刊>The Journal of the Acoustical Society of America >Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation
【24h】

Cochleagram-based audio pattern separation using two-dimensional non-negative matrix factorization with automatic sparsity adaptation

机译:使用自动稀疏度自适应的二维非负矩阵分解实现基于耳蜗的音频模式分离

获取原文
获取原文并翻译 | 示例
           

摘要

An unsupervised single channel audio separation method from pattern recognition viewpoint is presented. The proposed method does not require training knowledge and the separation system is based on non-uniform time-frequency (TF) analysis and feature extraction. Unlike conventional research that concentrates on the use of spectrogram or its variants, the proposed separation algorithm uses an alternative TF representation based on the gammatone filterbank. In particular, the monaural mixed audio signal is shown to be considerably more separable in this non-uniform TF domain. The analysis of signal separability to verify this finding is provided. In addition, a variational Bayesian approach is derived to learn the sparsity parameters for optimizing the matrix factorization. Experimental tests have been conducted, which show that the extraction of the spectral dictionary and temporal codes is more efficient using sparsity learning and subsequently leads to better separation performance.
机译:从模式识别的角度提出了一种无监督的单通道音频分离方法。所提出的方法不需要训练知识,并且分离系统基于不均匀的时频(TF)分析和特征提取。与专注于使用频谱图或其变体的常规研究不同,所提出的分离算法使用基于伽马通滤波器组的替代TF表示。特别是,单声道混合音频信号在该非均匀TF域中显示为可分离得多。提供了信号可分离性分析,以验证这一发现。此外,派生出一种变分贝叶斯方法来学习稀疏参数,以优化矩阵分解。已经进行了实验测试,表明使用稀疏学习对频谱字典和时间码的提取更为有效,并因此导致更好的分离性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号