Data Augmentation for Monaural Singing Voice Separation Based on Variational Autoencoder-Generative Adversarial Network

机译：基于变分自编码-生成对抗网络的单声道歌声分离数据增强

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Random mixing and circularly shifting for augmenting the training set are used to improve the separation effect of deep neural network (DNN)-based monaural singing voice separation (MSVS). However, these manual methods are based on unrealistic assumptions that two sources in the mixture are independent of each other, which limits the separation effect. This paper proposes a data augmentation method based on variational autoencoder (VAE) and generative adversarial network (GAN), which is called as VAE-GAN. The VAE models the observed spectra of sources (vocal and music) separately and reconstructs new spectra from the latent space. The GAN's discriminator is introduced to measure the correlation between the latent variables of the vocal and music generated by the VAE probability encoder. This adversarial mechanism in VAE's latent space could learn the synthetic likelihood and ultimately decode high quality spectra samples, which further improves the separation effect of general MSVS networks.

机译：用于增强训练集的随机混合和圆形转移用于改善深神经网络（DNN）的单声道歌唱语音分离（MSV）的分离效果。然而，这些手动方法基于不切实际的假设，即混合物中的两个来源彼此独立，这限制了分离效果。本文提出了一种基于变分性AutoEncoder（VAE）和生成对抗网络（GaN）的数据增强方法，称为VAE-GaN。 VAE分别模拟了源（声音和音乐）的观察光谱，并从潜在空间重建了新的光谱。介绍了GaN的鉴别器来测量VAE概率编码器产生的声音和音乐的潜在变量之间的相关性。 VAE潜在空间中的这种对抗机制可以学习合成似然性，最终解码高质量的光谱样本，这进一步提高了通用MSVS网络的分离效果。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2019年|1354-1359|共6页
会议地点
作者
Boxin He; Shengbei Wang; Weitao Yuan; Jianming Wang; Masashi Unoki;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Gallium nitride; Training; Correlation; Decoding; Neural networks; Generators; Gaussian distribution;

机译：氮化镓;训练;相关;解码;神经网络;发电机;高斯分布;

相似文献

外文文献
中文文献
专利

1. Data Augmentation-Based Prediction of System Level Performance under Model and Parameter Uncertainties: Role of Designable Generative Adversarial Networks (DGAN) [J] . Yoo Yeongmin, Jung Ui-Jin, Han Yong Ha, Reliability Engineering & System Safety . 2021,第Feba期

机译：基于数据增强的模型和参数不确定性的系统级性能预测：可名称生成对策网络的作用（DGAN）
2. A data augmentation method based on cycle-consistent adversarial networks for fluorescence encoded microsphere image analysis [J] . Shi Zaifeng, Liu Minghe, Cao Qingjie, Signal processing . 2019,第AUGa期

机译：基于周期一致对抗网络的荧光编码微球图像分析数据增强方法
3. A data augmentation method based on cycle-consistent adversarial networks for fluorescence encoded microsphere image analysis [J] . Shi Zaifeng, Liu Minghe, Cao Qingjie, Signal processing . 2019,第Auga期

机译：一种基于循环一致对抗网络的荧光编码微球图像分析的数据增强方法
4. Data Augmentation for Monaural Singing Voice Separation Based on Variational Autoencoder-Generative Adversarial Network [C] . Boxin He, Shengbei Wang, Weitao Yuan, IEEE International Conference on Multimedia and Expo . 2019

机译：基于变分性自动化器 - 生成对抗网络的单型歌唱语音分离的数据增强
5. Data Augmentation for Supervised Learning with Generative Adversarial Networks [D] . Podduturi, Manaswi. 2018

机译：具有生成对抗网络的监督学习的数据增强
6. Seismic Data Augmentation Based on Conditional Generative Adversarial Networks [O] . Yuanming Li, Bonhwa Ku, Shou Zhang, 2020

机译：基于条件生成对抗网络的地震数据增强
7. Reducing Interference with Phase Recovery in DNN-based Monaural Singing Voice Separation [O] . Paul Magron, Konstantinos Drossos, Stylianos Ioannis Mimilakis, 2018

机译：减少基于DNN的单声道歌唱语音分离中相恢复的干扰

Data Augmentation for Monaural Singing Voice Separation Based on Variational Autoencoder-Generative Adversarial Network

摘要

著录项

相似文献

相关主题

期刊订阅