...
首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation
【24h】

Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation

机译:神经网络和概率空间模型的集成,用于声盲源分离

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We formulate a generic framework for blind source separation (BSS), which allows integrating data-driven spectro-temporal methods, such as deep clustering and deep attractor networks, with physically motivated probabilistic spatial methods, such as complex angular central Gaussian mixture models. The integrated model exploits the complementary strengths of the two approaches to BSS: the strong modeling power of neural networks, which, however, is based on supervised learning, and the ease of unsupervised learning of the spatial mixture models whose few parameters can be estimated on as little as a single segment of a real mixture of speech. Experiments are carried out on both artificially mixed speech and true recordings of speech mixtures. The experiments verify that the integrated models consistently outperform the individual components. We further extend the models to cope with noisy, reverberant speech and introduce a cross-domain teacher-student training where the mixture model serves as the teacher to provide training targets for the student neural network.
机译:我们制定了用于盲源分离(BSS)的通用框架,该框架允许将数据驱动的光谱时空方法(例如深度聚类和深度吸引子网络)与物理动机概率空间方法(例如复杂的中心高斯混合角函数模型)集成在一起。集成模型利用了BSS两种方法的互补优势:神经网络的强大建模能力(然而,这是基于监督学习的)以及对空间混合模型进行无监督学习的简便性,该模型几乎无法估计任何参数少至真实语音混合的单个片段。对人工混合的语音和语音混合的真实录音都进行了实验。实验证明,集成模型始终优于单个组件。我们进一步扩展了模型以应对嘈杂的,混响的语音,并引入了跨域师生训练,其中混合模型充当老师,为学生神经网络提供训练目标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号