Content-based auto-tagging of audios using deep learning

机译：基于内容的Audios自动标记使用深度学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In the recent years, deep learning and feature learning have drawn significant attention in the field of Music Information Retrieval (MIR) research, inspired by good results in speech recognition and computer vision. Here, we tackle the problem of content-based automatic tagging of audios which is a multi-label classification task. Deep neural network architectures like Convolutional Neural Network and Convolutional Recurrent Neural Network are used to learn hierarchical features from musical audio signals and the experiments are performed on MagnaTagATune (MTT) dataset. We focused to achieve state-of-the-art performance with Mel-spectrogram input. Tags such as genre, instruments, emotions etc. can be automatically predicted for newer tracks with the focus on accurate classification of clips. These tags convey high-level information from a listener's perspective and thus can be used for organization of music library, efficient music browsing, creating personalized recommendations, playlist generation, and other applications.

机译：在近年来，深入学习和特色学习在音乐信息检索（MIR）研究领域中造成了重大关注，其在语音识别和计算机视觉中的良好结果启发。在这里，我们解决了基于内容的Audios自动标记的问题，这是一个多标签分类任务。卷积神经网络和卷积复发性神经网络等深度神经网络架构用于学习来自音频信号的分层特征，并且在MagnaTagatune（MTT）数据集上执行实验。我们专注于通过熔融谱图输入实现最先进的性能。可以自动预测诸如类型，仪器，情绪等的标签，以便较新的曲目专注于准确分类剪辑。这些标签从侦听器的角度传达高级信息，因此可以用于音乐库的组织，高效的音乐浏览，创建个性化的推荐，播放列表和其他应用程序。

著录项

来源
《International Conference on Big Data, IoT and Data Science》|2017年|198p|共7页
会议地点
作者
Rashmeet Kaur Nayyar; Sushmita Nair; Omkar Patil; Rasika Pawar; Amruta Lolage;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词
Feature extraction; Computer architecture; Biological neural networks; Convolution; Training; Recurrent neural networks; Convolutional neural networks;

机译：特征提取;计算机架构;生物神经网络;卷积;培训;经常性神经网络;卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Similarity-preserving hash for content-based audio retrieval using unsupervised deep neural networks [J] . Petcharat Panyapanuwat, Suwatchai Kamonsantiroj, Luepol Pipanmaekaporn International Journal of Electrical and Computer Engineering . 2021,第1期

机译：基于内容的音频检索的相似性保存哈希使用无监督的深神经网络
2. Visual content-based web page categorization with deep transfer learning and metric learning [J] . Lopez-Sanchez Daniel, Gonzalez Arrieta Angelica, Corchado Juan M. Neurocomputing . 2019,第APRa21期

机译：基于视觉内容的网页分类，包括深度迁移学习和度量学习
3. Visual content-based web page categorization with deep transfer learning and metric learning [J] . Lopez-Sanchez Daniel, Gonzalez Arrieta Angelica, Corchado Juan M. Neurocomputing . 2019,第Apra21期

机译：基于视觉内容的网页分类，深度传输学习和度量学习
4. Content-based auto-tagging of audios using deep learning [C] . Rashmeet Kaur Nayyar, Sushmita Nair, Omkar Patil, 2017 International Conference on Big Data, IoT and Data Science . 2017

机译：使用深度学习对音频进行基于内容的自动标记
5. Content-Based Image Retrieval using Deep Learning. [D] . Singh, Anshuman Vikram. 2015

机译：使用深度学习的基于内容的图像检索。
6. OtoMatch: Content-based eardrum image retrieval using deep learning [O] . Seda Camalan, Muhammad Khalid Khan Niazi, Aaron C. Moberly, 2020

机译：Otomatch：基于内容的耳膜图像检索使用深度学习
7. Similarity-preserving hash for content-based audio retrieval using unsupervised deep neural networks [O] . Petcharat Panyapanuwat, Suwatchai Kamonsantiroj, Luepol Pipanmaekaporn 2021

机译：基于内容的音频检索使用无监督的深神经网络的相似性 - 保留哈希

Content-based auto-tagging of audios using deep learning

摘要

著录项

相似文献

相关主题

期刊订阅