首页> 外文会议>Audio Engineering Society Convention >Vibrary: A Consumer-Trainable Music Tagging Utility
【24h】

Vibrary: A Consumer-Trainable Music Tagging Utility

机译:狂热:消费者可训练的音乐标记效用

获取原文

摘要

We present the engineering underlying a consumer application to help music industry professionals find audio clips and samples of personal interest within their large audio libraries typically consisting of heterogeneously-labeled clips supplied by various vendors. We enable users to train an indexing system using their own custom tags (e.g., instruments, genres, moods), by means of convolutional neural networks operating on spectrograms. Since the intended users are not data scientists and may not possess the required computational resources (i.e. Graphics Processing Units, GPUs), our primary contributions consist of i) designing an intuitive user experience for a local client application to help users create representative spectrogram datasets, and ii) 'seamless' integration with a cloud-based GPU server for efficient neural network training.
机译:我们介绍了消费者申请的基础,以帮助音乐行业专业人员在其大型音频库中找到音频剪辑和个人兴趣的样本,通常由各种供应商提供的异质标记的剪辑组成。我们使用户能够使用自己的自定义标签(例如,仪器,流派,情绪)培训索引系统,通过在谱图上运行的卷积神经网络。由于预期用户不是数据科学家,并且可能不具有所需的计算资源(即图形处理单元,GPU),我们的主要贡献包括i)为本地客户端应用程序设计直观的用户体验,以帮助用户创建代表频谱图数据集, II)与基于云的GPU服务器的“无缝”集成,以实现高效的神经网络培训。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号