首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Semi-supervised Monaural Singing Voice Separation with a Masking Network Trained on Synthetic Mixtures
【24h】

Semi-supervised Monaural Singing Voice Separation with a Masking Network Trained on Synthetic Mixtures

机译:半监督单次唱歌语音分离,屏蔽网络培养了合成混合物

获取原文

摘要

We study the problem of semi-supervised singing voice separation, in which the training data contains a set of samples of mixed music (singing and instrumental) and an unmatched set of instrumental music. Our solution employs a single mapping function g, which, applied to a mixed sample, recovers the underlying instrumental music, and, applied to an instrumental sample, returns the same sample. The network g is trained using purely instrumental samples, as well as on synthetic mixed samples that are created by mixing reconstructed singing voices with random instrumental samples. Our results indicate that we are on a par with or better than fully supervised methods, which are also provided with training samples of unmixed singing voices, and are better than other recent semi-supervised methods.
机译:我们研究了半监督歌唱语音分离的问题,其中培训数据包含一组混合音乐样本(唱歌和乐器)和无与伦比的乐器音乐。我们的解决方案采用单个映射函数G,其应用于混合样品,恢复底层的乐器音乐,并应用于乐器样本,返回相同的样本。网络G使用纯粹的仪器样本培训,以及通过将重建的歌声与随机仪器样品混合来创建的合成混合样品。我们的结果表明,我们与完全监督的方法相提并论,也可以提供未混合的唱歌声音的培训样本,并且比其他最近的半监督方法更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号