Audio Tagging With Connectionist Temporal Classification Model Using Sequentially Labelled Data

机译：使用顺序标记数据使用依次标记的数据标记使用连接员时间分类模型标记

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio tagging aims to predict one or several labels in an audio clip. Many previous works use weakly labelled data (WLD) for audio tagging, where only presence or absence of sound events is known, but the order of sound events is unknown. To use the order information of sound events, we propose sequentially labelled data (SLD), where both the presence or absence and the order information of sound events are known. To utilize SLD in audio tagging, we propose a convolutional recurrent neural network followed by a connectionist temporal classification (CRNN-CTC) objective function to map from an audio clip spectrogram to SLD. Experiments show that CRNN-CTC obtains an area under curve (AUC) score of 0.986 in audio tagging, outperforming the baseline CRNN of 0.908 and 0.815 with max pooling and average pooling, respectively. In addition, we show CRNN-CTC has the ability to predict the order of sound events in an audio clip.

机译：音频标记旨在预测音频剪辑中的一个或多个标签。许多以前的作品使用弱标记的数据（WLD）进行音频标记，其中只知道声音事件的存在或不存在，但声音事件的顺序是未知的。要使用声音事件的订单信息，我们提出了顺序标记的数据（SLD），其中声音事件的存在或缺失和订单信息都是已知的。为了利用音频标记中的SLD，我们提出了一种卷积经常性神经网络，然后是从音频剪辑谱图到SLD的连接主人时间分类（CRNN-CTC）目标函数。实验表明，CRNN-CTC在音频标签中获得0.986的曲线（AUC）得分的区域，优于最大汇集和平均池的基线CRNN为0.908和0.815。此外，我们显示CRNN-CTC能够预测音频剪辑中的声音事件的顺序。

著录项

来源
《International conference on communications, signal processing, and systems》|2020年|xviii p. 853-1462|共10页
会议地点
作者
Yuanbo Hou; Qiuqiang Kong; Shengchen Li;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词
Audio tagging; Sequentially labelled data (SLD); Convolutional recurrent neural network (CRNN); Connectionist temporal classification (CTC);

机译：音频标记;依次标记数据（SLD）;卷积复发性神经网络（CRNN）;连接主义时间分类（CTC）;

相似文献

外文文献
中文文献
专利

1. Connectionist Temporal Classification Model for Dynamic Hand Gesture Recognition using RGB and Optical flow Data [J] . Patel Sunil, Makwana Ramji The international arab journal of information technology . 2020,第4期

机译：使用RGB和光学流数据的动态手势识别的连接员时间分类模型
2. Long short-term memory recurrent neural network-based acoustic model using connectionist temporal classification on a large-scale training corpus [J] . Donghyun Lee, Minkyu Lim, Hosung Park, Communications, China . 2017,第9期

机译：大型训练语料库上使用连接器时间分类的基于长期短期记忆递归神经网络的声学模型
3. Spatio-temporal data classification through multidimensional sequential patterns: Application to crop mapping in complex landscape [J] . Yoann Pitarch, Dino Ienco, Elodie Vintrou, Engineering Applications of Artificial Intelligence . 2015,第jana期

机译：通过多维顺序模式进行时空数据分类：在复杂景观中的作物制图中的应用
4. Audio Tagging With Connectionist Temporal Classification Model Using Sequentially Labelled Data [C] . Yuanbo Hou, Qiuqiang Kong, Shengchen Li International conference on communications, signal processing, and systems . 2020

机译：使用顺序标记数据使用依次标记的数据标记使用连接员时间分类模型标记
5. Models of EEG data mining and classification in temporal lobe epilepsy: Wavelet-chaos-neural network methodology and spiking neural networks. [D] . Ghosh Dastidar, Samanwoy. 2007

机译：颞叶癫痫的EEG数据挖掘和分类模型：小波-混沌神经网络方法和尖峰神经网络。
6. Deep learning models for bacteria taxonomic classification of metagenomic data [O] . Antonino Fiannaca, Laura La Paglia, Massimo La Rosa, 2018

机译：用于宏基因组数据的细菌分类学分类的深度学习模型
7. Sound Event Detection with Sequentially Labelled Data Based on Connectionist Temporal Classification and Unsupervised Clustering [O] . Yuanbo Hou, Qiuqiang Kong, Shengchen Li, 2019

机译：声音事件检测，具有基于连接员时间分类和无监督群集的顺序标记的数据

Audio Tagging With Connectionist Temporal Classification Model Using Sequentially Labelled Data

摘要

著录项

相似文献

相关主题

期刊订阅