首页> 外文会议>International Conference on Big Data, IoT and Data Science >Content-based auto-tagging of audios using deep learning
【24h】

Content-based auto-tagging of audios using deep learning

机译:基于内容的Audios自动标记使用深度学习

获取原文

摘要

In the recent years, deep learning and feature learning have drawn significant attention in the field of Music Information Retrieval (MIR) research, inspired by good results in speech recognition and computer vision. Here, we tackle the problem of content-based automatic tagging of audios which is a multi-label classification task. Deep neural network architectures like Convolutional Neural Network and Convolutional Recurrent Neural Network are used to learn hierarchical features from musical audio signals and the experiments are performed on MagnaTagATune (MTT) dataset. We focused to achieve state-of-the-art performance with Mel-spectrogram input. Tags such as genre, instruments, emotions etc. can be automatically predicted for newer tracks with the focus on accurate classification of clips. These tags convey high-level information from a listener's perspective and thus can be used for organization of music library, efficient music browsing, creating personalized recommendations, playlist generation, and other applications.
机译:在近年来,深入学习和特色学习在音乐信息检索(MIR)研究领域中造成了重大关注,其在语音识别和计算机视觉中的良好结果启发。在这里,我们解决了基于内容的Audios自动标记的问题,这是一个多标签分类任务。卷积神经网络和卷积复发性神经网络等深度神经网络架构用于学习来自音频信号的分层特征,并且在MagnaTagatune(MTT)数据集上执行实验。我们专注于通过熔融谱图输入实现最先进的性能。可以自动预测诸如类型,仪器,情绪等的标签,以便较新的曲目专注于准确分类剪辑。这些标签从侦听器的角度传达高级信息,因此可以用于音乐库的组织,高效的音乐浏览,创建个性化的推荐,播放列表和其他应用程序。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号