Demystifying TasNet: A Dissecting Approach

机译：揭开TasNet神秘面纱：一种剖析方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In recent years time domain speech separation has excelled over frequency domain separation in single channel scenarios and noise-free environments. In this paper we dissect the gains of the time-domain audio separation network (TasNet) approach by gradually replacing components of an utterance-level permutation invariant training (u-PIT) based separation system in the frequency domain until the TasNet system is reached, thus blending components of frequency domain approaches with those of time domain approaches. Some of the intermediate variants achieve comparable signal-to-distortion ratio (SDR) gains to TasNet, but retain the advantage of frequency domain processing: compatibility with classic signal processing tools such as frequency-domain beamforming and the human interpretability of the masks. Furthermore, we show that the scale invariant signal-to-distortion ratio (si-SDR) criterion used as loss function in TasNet is related to a logarithmic mean square error criterion and that it is this criterion which contributes most reliable to the performance advantage of TasNet. Finally, we critically assess which gains in a noise-free single channel environment generalize to more realistic reverberant conditions.

机译：近年来，在单通道场景和无噪声环境中，时域语音分离优于频域分离。在本文中，我们通过逐步在频域中替换基于发声级置换不变训练（u-PIT）的分离系统的组件，直到达到TasNet系统，来剖析时域音频分离网络（TasNet）方法的收益，因此，将频域方法的组件与时域方法的组件混合在一起。某些中间变体可实现与TasNet相当的信噪比（SDR）增益，但保留了频域处理的优势：与经典信号处理工具（如频域波束形成）和口罩的人为解释性兼容。此外，我们表明，在TasNet中用作损耗函数的尺度不变信噪比（si-SDR）准则与对数均方误差准则有关，正是该准则对提高性能具有最可靠的贡献。 TasNet。最后，我们严格评估在无噪声单声道环境中哪些增益可以推广到更现实的混响条件。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|6359-6363|共5页
会议地点
作者
Jens Heitkaemper; Darius Jakobeit; Christoph Boeddeker; Lukas Drude; Reinhold Haeb-Umbach;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
source separation; multichannel source separation; robust automatic speech recognition;

机译：源分离;多通道源分离;鲁棒的自动语音识别;
入库时间 2022-08-26 14:46:48

相似文献

外文文献
中文文献
专利

1. LINDA-BN: An interpretable probabilistic approach for demystifying black-box predictive models [J] . Moreira Catarina, Chou Yu-Liang, Velmurugan Mythreyi, Decision support systems . 2021,第Nova期

机译：Linda-BN：一种可解释的概率方法，用于搅滑黑匣子预测模型
2. Demystifying the brain: a computational approach [J] . Joao Luis G. Rosa Computing reviews . 2020,第2期

机译：揭开大脑神秘面纱：一种计算方法
3. Demystifying organisational embeddedness of leadership - a multi-method approach to validate a new construct [J] . Busse Ronald, Winnen Lothar, Wilms Rafael, Leadership and organization development journal . 2020,第2期

机译：DemyStify领导的组织嵌入性 - 一种多种方法方法来验证新的构建
4. Demystifying TasNet: A Dissecting Approach [C] . Jens Heitkaemper, Darius Jakobeit, Christoph Boeddeker, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：DemyStify Tasnet：一个解剖方法
5. Demystifying the Commodification of Social Relations in the Ontario Child Protection System: A Marxist Approach to Textual Analysis. [D] . Preston, Susan Elizabeth. 2013

机译：安省儿童保护制度中社会关系商品化的神秘化：一种文本分析的马克思主义方法。
6. Competency-Based Approach in Training Nurses and Midwives in Morocco Demystify to Better Use [O] . Said Abouzaj 2019

机译：摩洛哥培训护士和助产士的以能力为基础的方法揭秘了更好的使用方法
7. Demystifying the pathway of assessment and treatment for bipolar disorder – utilising co-production and algorithms to personalise the approach [O] . Jessica Nicholls-Mindlin, Angus McLellan, David Gee, 2021

机译：揭开双相情感障碍评估和治疗的途径 - 利用共同生产和算法来个性化方法

Demystifying TasNet: A Dissecting Approach

摘要

著录项

相似文献

相关主题

期刊订阅