Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects

机译：自动多木混合与神经音频效应的可微分混合控制台

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Applications of deep learning to automatic multitrack mixing are largely unexplored. This is partly due to the limited available data, coupled with the fact that such data is relatively unstructured and variable. To address these challenges, we propose a domain-inspired model with a strong inductive bias for the mixing task. We achieve this with the application of pre-trained sub-networks and weight sharing, as well as with a sum/difference stereo loss function. The proposed model can be trained with a limited number of examples, is permutation invariant with respect to the input ordering, and places no limit on the number of input sources. Furthermore, it produces human-readable mixing parameters, allowing users to manually adjust or refine the generated mix. Results from a perceptual evaluation involving audio engineers indicate that our approach generates mixes that outperform baseline approaches. To the best of our knowledge, this work demonstrates the first approach in learning multitrack mixing conventions from real-world data at the waveform level, without knowledge of the underlying mixing parameters.

机译：深度学习对自动多条混合的应用主要是未开发的。这部分是由于有限的可用数据，与此类数据相对非结构化和变量相结合。为了解决这些挑战，我们提出了一个具有强烈归纳偏见的域激发模型，用于混合任务。我们通过应用预先训练的子网和重量共享以及总和/差异立体声损耗功能来实现这一目标。所提出的模型可以用有限数量的示例训练，是相对于输入排序的置换不变，并且没有限制输入源的数量。此外，它产生人类可读的混合参数，允许用户手动调整或细化所产生的混合物。涉及音频工程师的感知评估的结果表明我们的方法产生了优于基线方法的混合。据我们所知，这项工作展示了在波形水平的真实数据中学习多杀世界的第一种方法，而不知道底层混合参数。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2021年|71-75|共5页
会议地点
作者
Christian J. Steinmetz; Jordi Pons; Santiago Pascual; Joan Serrà;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep learning; Convolution; Conferences; Music; Multiple signal classification; Task analysis; Speech processing;

机译：深入学习;卷积;会议;音乐;多信号分类;任务分析;语音处理;

相似文献

外文文献
中文文献
专利

1. Software-Defined Radio for Modular Audio Mixers: Making Use of Market-Available Audio Consoles and Software-Defined Radio to Build Multiparty Audio-Mixing Systems [J] . Samer Jaloudi Consumer Electronics Magazine, IEEE . 2017,第4期

机译：用于模块化混音器的软件定义的无线电：利用市场上可用的音频控制台和软件定义的无线电来构建多方音频混合系统
2. Populating the Mix Space: Parametric Methods for Generating Multitrack Audio Mixtures [J] . Bruno M. Fazenda, Alex Wilson Applied Sciences . 2017,第12期

机译：填充混合空间：生成多轨音频混合的参数方法
3. Populating the Mix Space: Parametric Methods for Generating Multitrack Audio Mixtures [J] . Bruno M. Fazenda, Alex Wilson Applied Sciences . 2017,第12期

机译：填充混合空间：生成多轨音频混合的参数方法
4. An automatic mixing system for multitrack spatialization for stereo based on unmasking and best panning practices [C] . Ajin Tom, Joshua Reiss, Philippe Depalle Audio Engineering Society international convention . 2019

机译：一种基于混音和最佳平移实践的立体声多轨空间化自动混音系统
5. Ai Driven Multitrack Audio Mixing Using Fuzzy Logic Tools [D] . Fermin, Luis Fernando. 2020

机译：使用模糊逻辑工具驱动的MultiTrict音频混合
6. Differential effects of Th1 monocyte/macrophage and Th2 cytokine mixtures on early gene expression for glial and neural-related molecules in central nervous system mixed glial cell cultures: neurotrophins growth factors and structural proteins [O] . Robert P Lisak, Joyce A Benjamins, Beverly Bealmear, 2007

机译：Th1单核细胞/巨噬细胞和Th2细胞因子混合物对中枢神经系统混合神经胶质细胞培养物中神经胶质和神经相关分子早期基因表达的差异影响：神经营养蛋白生长因子和结构蛋白
7. Populating the mix space : parametric methods for\ud generating multitrack audio mixtures [O] . Wilson, AD, Fazenda, BM 2017

机译：填充混合空间：\ ud的参数方法生成多轨混合音频

Automatic Multitrack Mixing With A Differentiable Mixing Console Of Neural Audio Effects

摘要

著录项

相似文献

相关主题

期刊订阅