首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning
【24h】

Pitch-Timbre Disentanglement Of Musical Instrument Sounds Based On Vae-Based Metric Learning

机译:基于VAE的公制学习的音乐仪器声音的俯仰 - TIMBRE解剖

获取原文

摘要

This paper describes a representation learning method for disentangling an arbitrary musical instrument sound into latent pitch and timbre representations. Although such pitch-timbre disentanglement has been achieved with a variational autoencoder (VAE), especially for a predefined set of musical instruments, the latent pitch and timbre representations are outspread, making them hard to interpret. To mitigate this problem, we introduce a metric learning technique into a VAE with latent pitch and timbre spaces so that similar (different) pitches or timbres are mapped close to (far from) each other. Specifically, our VAE is trained with additional contrastive losses so that the latent distances between two arbitrary sounds of the same pitch or timbre are minimized, and those of different pitches or timbres are maximized. This training is performed under weak supervision that uses only whether the pitches and timbres of two sounds are the same or not, instead of their actual values. This improves the generalization capability for unseen musical instruments. Experimental results show that the proposed method can find better-structured disentangled representations with pitch and timbre clusters even for unseen musical instruments.
机译:本文介绍了解开任意乐器声音的代表学习方法,进入潜在音高和Timbre表示。尽管使用变形的AutoEncoder(VAE)已经实现了这种俯仰肌腱脱离,但特别是对于预定义的乐器组,潜在的沥青和Timbre表示是未应用的,使它们难以解释。为了缓解此问题,我们将公制学习技术引入潜在俯仰和Timbre空间的VAE,以便与彼此相似的(不同)的音高或Timbres映射到彼此接近(远离)。具体而言,我们的VAE被训练,具有额外的对比损失,使得最小化相同间距或Timbre的两个任意声音之间的潜距离,并且不同间距或摩擦的距离最大化。此培训在弱监管下执行,仅使用两个声音的音高和Timbres是相同的,而不是它们的实际值。这改善了看不见的乐器的泛化能力。实验结果表明,即使针对看不见的乐器,所提出的方法也可以找到具有音高和Timbre集群的更好地结构的解剖表现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号