Improved music feature learning with deep neural networks

机译：用深神经网络改进音乐特色学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent advances in neural network training provide a way to efficiently learn representations from raw data. Good representations are an important requirement for Music Information Retrieval (MIR) tasks to be performed successfully. However, a major problem with neural networks is that training time becomes prohibitive for very large datasets and the learning algorithm can get stuck in local minima for very deep and wide network architectures. In this paper we examine 3 ways to improve feature learning for audio data using neural networks: 1.using Rectified Linear Units (ReLUs) instead of standard sigmoid units; 2.using a powerful regularisation technique called Dropout; 3.using Hessian-Free (HF) optimisation to improve training of sigmoid nets. We show that these methods provide significant improvements in training time and the features learnt are better than state of the art handcrafted features, with a genre classification accuracy of 83 ± 1.1% on the Tzanetakis (GTZAN) dataset. We found that the rectifier networks learnt better features than the sigmoid networks. We also demonstrate the capacity of the features to capture relevant information from audio data by applying them to genre classification on the ISMIR 2004 dataset.

机译：神经网络培训的最新进展提供了一种有效地从原始数据学习表示的方法。良好的表示是您成功执行的音乐信息检索（MIR）任务的重要要求。然而，神经网络的主要问题是，对于非常大的数据集来说，训练时间变得令人望而却步，并且学习算法可以在极限和广泛的网络架构中陷入局部最小值。在本文中，我们研究了使用神经网络改进音频数据的特征学习的方法：1。排除整流的线性单元（Relus）代替标准的乙状结单位; 2.使用一个称为辍学的强大正则化技术; 3.使用Hessian的（HF）优化，以改善围网训练。我们表明，这些方法在培训时间提供了显着的改进，并且学到的特征优于艺术的现状优于艺术功能，具有83±1.1％的Tzanetakis（GTZAN）数据集。我们发现整流网络学会了比Sigmoid网络更好的特征。我们还通过将其应用于ISMIR 2004 DataSet上的流派分类来展示功能从音频数据捕获相关信息的能力。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2014年||共5页
会议地点
作者
Sigtia Siddharth; Dixon Simon;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类通信理论;
关键词
Deep Learning; MIR; Neural Networks;

机译：深入学习;MIR;神经网络;
入库时间 2022-08-21 05:10:35

相似文献

外文文献
中文文献
专利

1. Improved Feature Learning: A Maximum-Average-Out Deep Neural Network for the Game Go [J] . Xiali Li, Zhengyu Lv, Bo Liu, Mathematical Problems in Engineering: Theory, Methods and Applications . 2020,第1期

机译：改进的特点学习：游戏的最大平均深神经网络
2. Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning [J] . Zhengqi Wen, Kehuang Li, Zhen Huang, Journal of signal processing systems for signal, image, and video technology . 2018,第7期

机译：通过上下文特征参数化和多任务学习改善基于深度神经网络的语音合成
3. Deep Learning for Consumer Devices and Services 4—A Review of Learnable Data Augmentation Strategies for Improved Training of Deep Neural Networks [J] . Lemley Joseph, Corcoran Peter Consumer Electronics Magazine, IEEE . 2020,第3期

机译：深度学习消费者设备和服务4-对深神经网络改进培训的学习数据增强策略的回顾
4. Improved music feature learning with deep neural networks [C] . Sigtia Siddharth, Dixon Simon IEEE International Conference on Acoustics, Speech and Signal Processing . 2014

机译：借助深度神经网络改善音乐特征学习
5. Advanced Music Audio Feature Learning with Deep Networks. [D] . Daigneau, Madeleine. 2017

机译：借助深度网络进行高级音乐音频功能学习。
6. A Deep Learning Model for Fault Diagnosis with a Deep Neural Network and Feature Fusion on Multi-Channel Sensory Signals [O] . Qing Ye, Shaohu Liu, Changhua Liu 2020

机译：深度神经网络故障诊断的深度学习模型多通道感觉信号的特征融合
7. Improving deep convultional neural networks with unsupervised feature learning [O] . Nguyen Thanh Kien, Fookes Clinton B., Sridharan Sridha 2015

机译：通过无监督特征学习改进深度卷积神经网络

Improved music feature learning with deep neural networks

摘要

著录项

相似文献

相关主题

期刊订阅