首页> 外文会议>2018 52nd Asilomar Conference on Signals, Systems, and Computers >Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation
【24h】

Examining the Perceptual Effect of Alternative Objective Functions for Deep Learning Based Music Source Separation

机译:检查替代目标函数对基于深度学习的音乐源分离的感知效果

获取原文
获取原文并翻译 | 示例

摘要

In this study, we examine the effect of various objective functions used to optimize the recently proposed deep learning architecture for singing voice separation MaD - Masker and Denoiser. The parameters of the MaD architecture are optimized using an objective function that contains a reconstruction criterion between predicted and true magnitude spectra of the singing voice, and a regularization term. We examine various reconstruction criteria such as the generalized Kullback-Leibler, mean squared error, and noise to mask ratio. We also explore recently proposed, for optimizing MaD, regularization terms such as sparsity and TwinNetwork regularization. Results from both objective assessment and listening tests suggest that the TwinNetwork regularization results in improved singing voice separation quality.
机译:在这项研究中,我们研究了各种目标函数的效果,这些目标函数用于优化最近提出的用于唱歌语音分离MaD的深度学习架构-Masker和Denoiser。使用目标函数对MaD架构的参数进行优化,该目标函数包含在演唱声音的预测幅度和真实幅度谱之间的重建标准以及正则项。我们研究了各种重建标准,例如广义的Kullback-Leibler,均方误差和噪声与掩模比率。我们还探索了最近提出的用于优化MaD的正则化术语,例如稀疏性和TwinNetwork正则化。客观评估和听力测试的结果均表明,TwinNetwork正则化可提高歌声分离质量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号