首页> 外文会议>Audio Engineering Society convention >Vibrary: A Consumer-Trainable Music Tagging Utility
【24h】

Vibrary: A Consumer-Trainable Music Tagging Utility

机译:振动:消费者可培训的音乐标签实用程序

获取原文

摘要

We present the engineering underlying a consumer application to help music industry professionals find audio clips and samples of personal interest within their large audio libraries typically consisting of heterogeneously-labeled clips supplied by various vendors. We enable users to train an indexing system using their own custom tags (e.g., instruments, genres, moods), by means of convolutional neural networks operating on spectrograms. Since the intended users arc not data scientists and may not possess the required computational resources (i.e. Graphics Processing Units, GPUs), our primary contributions consist of ⅰ) designing an intuitive user experience for a local client application to help users create representative spectrogram datasets, and ⅱ) 'seamless' integration with a cloud-based GPU server for efficient neural network training.
机译:我们提供了一个消费者应用程序基础的工程,以帮助音乐行业专业人士在其大型音频库中找到音频剪辑和个人感兴趣的样本,这些音频库通常由各种供应商提供的带有异类标签的剪辑组成。我们使用户能够通过在谱图上运行的卷积神经网络,使用自己的自定义标签(例如乐器,体裁,情绪)训练索引系统。由于目标用户不是数据科学家,并且可能不具备所需的计算资源(即图形处理单元,GPU),因此我们的主要贡献包括ⅰ)为本地客户端应用程序设计直观的用户体验,以帮助用户创建代表性的频谱图数据集, ⅱ)与基于云的GPU服务器“无缝”集成,以进行有效的神经网络训练。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号