首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects
【24h】

Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects

机译:自动多木混合与神经音频效应的可微分混合控制台

获取原文

摘要

Applications of deep learning to automatic multitrack mixing are largely unexplored. This is partly due to the limited available data, coupled with the fact that such data is relatively unstructured and variable. To address these challenges, we propose a domain-inspired model with a strong inductive bias for the mixing task. We achieve this with the application of pre-trained sub-networks and weight sharing, as well as with a sum/difference stereo loss function. The proposed model can be trained with a limited number of examples, is permutation invariant with respect to the input ordering, and places no limit on the number of input sources. Furthermore, it produces human-readable mixing parameters, allowing users to manually adjust or refine the generated mix. Results from a perceptual evaluation involving audio engineers indicate that our approach generates mixes that outperform baseline approaches. To the best of our knowledge, this work demonstrates the first approach in learning multitrack mixing conventions from real-world data at the waveform level, without knowledge of the underlying mixing parameters.
机译:深度学习对自动多条混合的应用主要是未开发的。这部分是由于有限的可用数据,与此类数据相对非结构化和变量相结合。为了解决这些挑战,我们提出了一个具有强烈归纳偏见的域激发模型,用于混合任务。我们通过应用预先训练的子网和重量共享以及总和/差异立体声损耗功能来实现这一目标。所提出的模型可以用有限数量的示例训练,是相对于输入排序的置换不变,并且没有限制输入源的数量。此外,它产生人类可读的混合参数,允许用户手动调整或细化所产生的混合物。涉及音频工程师的感知评估的结果表明我们的方法产生了优于基线方法的混合。据我们所知,这项工作展示了在波形水平的真实数据中学习多杀世界的第一种方法,而不知道底层混合参数。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号