Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation

机译：检查替代目标函数对基于深度学习的音乐源分离的感知效果

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this study, we examine the effect of various objective functions used to optimize the recently proposed deep learning architecture for singing voice separation MaD - Masker and Denoiser. The parameters of the MaD architecture are optimized using an objective function that contains a reconstruction criterion between predicted and true magnitude spectra of the singing voice, and a regularization term. We examine various reconstruction criteria such as the generalized Kullback-Leibler, mean squared error, and noise to mask ratio. We also explore recently proposed, for optimizing MaD, regularization terms such as sparsity and TwinNetwork regularization. Results from both objective assessment and listening tests suggest that the TwinNetwork regularization results in improved singing voice separation quality.

机译：在这项研究中，我们研究了各种目标函数的效果，这些目标函数用于优化最近提出的用于唱歌语音分离MaD的深度学习架构-Masker和Denoiser。使用目标函数对MaD架构的参数进行优化，该目标函数包含在演唱声音的预测幅度和真实幅度谱之间的重建标准以及正则项。我们研究了各种重建标准，例如广义的Kullback-Leibler，均方误差和噪声与掩模比率。我们还探索了最近提出的用于优化MaD的正则化术语，例如稀疏性和TwinNetwork正则化。客观评估和听力测试的结果均表明，TwinNetwork正则化可提高歌声分离质量。

著录项

来源
《2018 52nd Asilomar Conference on Signals, Systems, and Computers》|2018年|679-683|共5页
会议地点 Pacific Grove(US)
作者
Stylianos Ioannis Mimilakis; Estefanía Cano; Derry FitzGerald; Konstantinos Drossos; Gerald Schuller;
展开▼
作者单位

Fraunhofer IDMT, Ilmenau, Germany;

Fraunhofer IDMT, Ilmenau, Germany;

AudioSourceRE, Cork, Ireland;

Laboratory of Signal Processing, Tampere University of Technology, Tampere, Finland;

Technical University of Ilmenau, Ilmenau, Germany;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Linear programming; Source separation; Measurement; Computer architecture; Deep learning; Task analysis; Neural networks;

机译：线性规划;源分离;测量;计算机体系结构;深度学习;任务分析;神经网络;;

相似文献

外文文献
中文文献
专利

1. Perceptually enhanced blind single-channel music source separation by Non-negative Matrix Factorization [J] . Kirbiz S., Günsel B. Digital Signal Processing . 2013,第2期

机译：非负矩阵分解可感知地增强盲单通道音乐源分离
2. Musical Part Separation Based on Perceptual Hierarchy [J] . Tomoyoshi Kinoshita, Ibuki Handa, Makoto Muto, Systems and Computers in Japan . 2007,第2期

机译：基于感知层次的音乐部分分离
3. On the conditions for valid objective functions in blind separation of independent and dependent sources [J] . Cesar F Caiafa EURASIP journal on advances in signal processing . 2012,第1期

机译：关于独立和依赖源盲分离中有效目标函数的条件
4. Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation [C] . Stylianos Ioannis Mimilakis, Estefanía Cano, Derry FitzGerald, Asilomar Conference on Signals, Systems, and Computers . 2018

机译：检查基于深度学习的音乐源分离的替代客观函数的感知效果
5. Examining experiences of teaching music to a child with autism while using a music learning theory-based intervention during informal music sessions infused with DIR/Floortime strategies [D] . Griffith, Claire E. 2009

机译：审查在自律儿童中教音乐的经验，同时在非正式音乐课中使用基于音乐学习理论的干预措施，并灌输DIR / Floortime策略
6. Modeling and Forecasting the GPS Zenith Troposphere Delay in West Antarctica Based on Different Blind Source Separation Methods and Deep Learning [O] . Qingchuan Zhang, Fei Li, Shengkai Zhang, 2020

机译：基于不同盲源分离方法和深度学习的南极GPS天顶对流层延迟建模与预测
7. On Training Targets and Objective Functions for Deep-learning-based Audio-visual Speech Enhancement [O] . Daniel Michelsanti, Zheng-Hua Tan, Sigurdur Sigurdsson, 2019

机译：基于深度学习的视听语音增强的培训目标和客观函数
8. Neural and Computational Mechanisms of Perceptual Decisions Between Multiple Alternatives Based on Multiple Sources of Evidence. [R] . Ditterich, J. 2010

机译：基于多源证据的多种方案感知决策的神经和计算机制。

Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation

摘要

著录项

相似文献

相关主题

期刊订阅