Sample-Level CNN Architectures for Music Auto-Tagging Using Raw Waveforms

机译：使用原始波形的音乐自动标记的示例级CNN架构

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent work has shown that the end-to-end approach using convolutional neural network (CNN) is effective in various types of machine learning tasks. For audio signals, the approach takes raw waveforms as input using an 1-D convolution layer. In this paper, we improve the 1-D CNN architecture for music auto-tagging by adopting building blocks from state-of-the-art image classification models, ResNets and SENets, and adding multi-level feature aggregation to it. We compare different combinations of the modules in building CNN architectures. The results show that they achieve significant improvements over previous state-of-the-art models on the MagnaTagATune dataset and comparable results on Million Song Dataset. Furthermore, we analyze and visualize our model to show how the 1-D CNN operates.

机译：最近的工作表明，使用卷积神经网络（CNN）的端到端方法在各种类型的机器学习任务中是有效的。对于音频信号，该方法采用原始波形作为使用1-D卷积层的输入。在本文中，我们通过采用最先进的图像分类模型，Resnet和Senet，以及向其添加多级别特征聚合来改进音乐自动标记的1-D CNN架构。我们比较模块在建立CNN架构中的不同组合。结果表明，它们对百万歌曲数据集上的先前最先进模型的显着改进，百万歌曲数据集。此外，我们分析和可视化我们的模型，以展示1-D CNN如何运行。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|605p|共5页
会议地点
作者
Taejun Kim; Jongpil Lee; Juhan Nam;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
convolutional neural networks; music auto-tagging; raw waveforms; multi-level learning;

机译：卷积神经网络;音乐自动标记;原始波形;多级学习;

相似文献

外文文献
中文文献
专利

1. A sample-level DCNN for music auto-tagging [J] . Yu Yong-bin, Qi Min-hui, Tang Yi-fan, Multimedia Tools and Applications . 2021,第8期

机译：用于音乐自动标记的示例级别DCNN
2. Efficient Music Auto-Tagging with Convolutional Neural Networks [J] . Shaleen Bengani, S. Vadivel, J. Angel Arul Jothi Journal of computer sciences . 2019,第8期

机译：卷积神经网络的高效音乐自动标记
3. Music auto-tagging based on the unified latent semantic modeling [J] . Shao Xi, Cheng Zhiyong, Kankanhalli Mohan S. Multimedia Tools and Applications . 2019,第1期

机译：基于统一潜在语义建模的音乐自动标记
4. Sample-Level CNN Architectures for Music Auto-Tagging Using Raw Waveforms [C] . Taejun Kim, Jongpil Lee, Juhan Nam IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：使用原始波形的音乐自动标记的示例级CNN架构
5. Content-Based Music Recommendation with the LFM-1b Dataset and Sample-Level Deep Convolutional Neural Networks [D] . Platt, Devin. 2017

机译：具有LFM-1b数据集和样本级深度卷积神经网络的基于内容的音乐推荐
6. Towards an Efficient CNN Inference Architecture Enabling In-Sensor Processing [O] . Md Jubaer Hossain Pantho, Pankaj Bhowmik, Christophe Bobda 2021

机译：迈向有效的CNN推理架构实现了传感器处理
7. Sample-level CNN Architectures for Music Auto-tagging Using Raw Waveforms [O] . Kim, Taejun, Lee, Jongpil, Nam, Juhan 2017

机译：使用Raw的音乐自动标记的样本级CNN架构波形

Sample-Level CNN Architectures for Music Auto-Tagging Using Raw Waveforms

摘要

著录项

相似文献

相关主题

期刊订阅