Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation

Drude Lukas; Haeb-Umbach Reinhold

首页> 外文期刊>Selected Topics in Signal Processing, IEEE Journal of >Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation

【24h】

Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation

机译：神经网络和概率空间模型的集成，用于声盲源分离

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We formulate a generic framework for blind source separation (BSS), which allows integrating data-driven spectro-temporal methods, such as deep clustering and deep attractor networks, with physically motivated probabilistic spatial methods, such as complex angular central Gaussian mixture models. The integrated model exploits the complementary strengths of the two approaches to BSS: the strong modeling power of neural networks, which, however, is based on supervised learning, and the ease of unsupervised learning of the spatial mixture models whose few parameters can be estimated on as little as a single segment of a real mixture of speech. Experiments are carried out on both artificially mixed speech and true recordings of speech mixtures. The experiments verify that the integrated models consistently outperform the individual components. We further extend the models to cope with noisy, reverberant speech and introduce a cross-domain teacher-student training where the mixture model serves as the teacher to provide training targets for the student neural network.

机译：我们制定了用于盲源分离（BSS）的通用框架，该框架允许将数据驱动的光谱时空方法（例如深度聚类和深度吸引子网络）与物理动机概率空间方法（例如复杂的中心高斯混合角函数模型）集成在一起。集成模型利用了BSS两种方法的互补优势：神经网络的强大建模能力（然而，这是基于监督学习的）以及对空间混合模型进行无监督学习的简便性，该模型几乎无法估计任何参数少至真实语音混合的单个片段。对人工混合的语音和语音混合的真实录音都进行了实验。实验证明，集成模型始终优于单个组件。我们进一步扩展了模型以应对嘈杂的，混响的语音，并引入了跨域师生训练，其中混合模型充当老师，为学生神经网络提供训练目标。

著录项

来源
《Selected Topics in Signal Processing, IEEE Journal of》 |2019年第4期|815-826|共12页
作者
Drude Lukas; Haeb-Umbach Reinhold;
展开▼
作者单位

Paderborn Univ Commun Engn Grp D-33098 Paderborn Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Blind source separation; speech processing; beam-forming; deep clustering; neural networks; teacher-student;

机译：盲源分离;语音处理;波束成形深度集群;神经网络;师生;

相似文献

外文文献
中文文献
专利

1. Robust neural networks with on-line learning for blind identification and blind separation of sources [J] . Cichocki A., Unbehauen R. IEEE Transactions on Circuits and Systems. 1 . 1996,第11期

机译：具有在线学习功能的鲁棒神经网络，用于盲识别和盲分离源
2. Neural networks for blind separation with unknown number of sources [J] . andrzej Cichocki, Juha Karhunen, Wlodzimierz Kasprzak Neurocomputing . 1999,第1a3期

机译：未知来源的盲目分离神经网络
3. Robust neural networks with on-line learning for blindidentification and blind separation of sources [J] . Cichocki A., Unbehauen R. IEEE Transactions on Circuits and Systems. I, Regular Papers . 1996,第11期

机译：具有在线学习功能的强大神经网络，用于盲目识别和盲目分离源
4. BLIND SOURCE SEPARATION WITH NEURAL NETWORKS: DEMISING SOURCES FROM MIXTURES WITH DIFFERENT PARAMETERS [C] . Iren Valova, Natacha Gueorguieva, Georgi Georgiev IEEE/AIAA Digital Avionics Systems Conference . 2006

机译：与神经网络的盲来源分离：从不同参数的混合物中脱离源
5. Acoustic Reflector Localisation for Blind Source Separation and Spatial Audio [D] . Remaggi, Luca. 2017

机译：声反射器定位，用于盲源分离和空间音频
6. Integrating probabilistic models of perception and interactive neural networks: a historical and tutorial review [O] . James L. McClelland 2013

机译：整合感知和交互式神经网络的概率模型：历史和教程评论
7. Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation [O] . Lukas Drude, Reinhold Haeb-Umbach 2019

机译：神经网络集成和声学盲源分离的概率空间模型
8. Neural Networks for Blind Separation with Unknown Number of Sources [R] . Cichocki, A., Karhunen, J., Kasprzak, W., 1998

机译：具有未知源数的盲分离神经网络

Integration of Neural Networks and Probabilistic Spatial Models for Acoustic Blind Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅