Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation

机译：深度吸引人网络用于说话人重新识别和盲源分离

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep clustering (DC) and deep attractor networks (DANs) are a data-driven way to monaural blind source separation. Both approaches provide astonishing single channel performance but have not yet been generalized to block-online processing. When separating speech in a continuous stream with a block-online algorithm, it needs to be determined in each block which of the output streams belongs to whom. In this contribution we solve this block permutation problem by introducing an additional speaker identification embedding to the DAN model structure. We motivate this model decision by analyzing the embedding topology of DC and DANs and show, that DC and DANs themselves are not sufficient for speaker identification. This model structure (a) improves the signal to distortion ratio (SDR) over a DAN baseline and (b) provides up to 61% and up to 34% relative reduction in permutation error rate and re-identification error rate compared to an i-vector baseline, respectively.

机译：深度聚类（DC）和深度吸引子网络（DAN）是一种数据驱动的单声道盲源分离方法。两种方法都提供了惊人的单通道性能，但尚未推广到在线块处理中。当使用块在线算法在连续流中分离语音时，需要在每个块中确定哪个输出流属于谁。在这一贡献中，我们通过引入嵌入DAN模型结构的附加说话人识别来解决此块置换问题。我们通过分析DC和DAN的嵌入拓扑结构来激发这一模型决策，并表明DC和DAN本身不足以识别说话人。该模型结构（a）在DAN基线上改善了信号失真比（SDR），并且（b）与置换模型相比，排列错误率和重新识别错误率的相对降低分别高达61％和34％。 i-vector基准线。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|11-15|共5页
会议地点
作者
Lukas Drude; Thilo von Neumann; Reinhold Haeb-Umbach;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Topology; Source separation; Indexes; Network topology; Principal component analysis;

机译：培训;拓扑;源分离;索引;网络拓扑;主成分分析;

相似文献

外文文献
中文文献
专利

1. Robust neural networks with on-line learning for blind identification and blind separation of sources [J] . Cichocki A., Unbehauen R. IEEE Transactions on Circuits and Systems. 1 . 1996,第11期

机译：具有在线学习功能的鲁棒神经网络，用于盲识别和盲分离源
2. Deep neural networks based binary classification for single channel speaker independent multi-talker speech separation [J] . Saleem Nasir, Khattak Muhammad Irfan Applied Acoustics . 2020,第Octa期

机译：基于深度神经网络的单通道扬声器独立多讲车语音分离二进制分类
3. A Speaker-Dependent Approach to Single-Channel Joint Speech Separation and Acoustic Modeling Based on Deep Neural Networks for Robust Recognition of Multi-Talker Speech [J] . Yan-Hui Tu, Jun Du, Chin-Hui Lee Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：基于说话者的基于深度神经网络的单通道联合语音分离和声学建模方法，用于多语音对话的鲁棒识别
4. Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation [C] . Lukas Drude, Thilo von Neumann, Reinhold Haeb-Umbach IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：扬声器重新识别和盲源分离的深吸引力网络
5. Anchor Word based Deep Attractor Network for Multi-Speaker Separation [D] . Qian Jiayi 2019

机译：基于锚词的深层吸引网络用于多扬声器分离
6. DEEP ATTRACTOR NETWORK FOR SINGLE-MICROPHONE SPEAKER SEPARATION [O] . Zhuo Chen, Yi Luo, Nima Mesgarani -1

机译：用于单麦克风扬声器分离的深层吸引器网络
7. Speaker-independent Speech Separation with Deep Attractor Network [O] . Luo, Yi, Chen, Zhuo, Mesgarani, Nima 2017

机译：与深度吸引子网络无关的说话人语音分离
8. Neural Networks for Blind Separation with Unknown Number of Sources [R] . Cichocki, A., Karhunen, J., Kasprzak, W., 1998

机译：具有未知源数的盲分离神经网络

Deep Attractor Networks for Speaker Re-Identification and Blind Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅