Weakly-supervised audio event detection using event-specific Gaussian filters and fully convolutional networks

机译：使用事件特定的高斯滤波器和完全卷积网络进行弱监督的音频事件检测

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Audio event detection aims at discovering the elements inside an audio clip. In addition to labeling the clips with the audio events, we want to find out the temporal locations of these events. However, creating clearly annotated training data can be time-consuming. Therefore, we provide a model based on convolutional neural networks that relies only on weakly-supervised data for training. These data can be directly obtained from online platforms, such as Freesound, with the clip-level labels assigned by the uploaders. The structure of our model is extended to a fully convolutional networks, and an event-specific Gaussian filter layer is designed to advance its learning ability. Besides, this model is able to detect frame-level information, e.g., the temporal position of sounds, even when it is trained merely with clip-level labels.

机译：音频事件检测旨在发现音频剪辑中的元素。除了用音频事件标记剪辑外，我们还想找出这些事件的时间位置。但是，创建带有注释的训练数据非常耗时。因此，我们提供了基于卷积神经网络的模型，该模型仅依赖于弱监督数据进行训练。这些数据可以直接从在线平台（例如Freesound）获取，并具有上传者分配的剪辑级别标签。我们模型的结构扩展到一个完全卷积的网络，并设计了一个特定于事件的高斯滤波器层来提高其学习能力。此外，即使仅使用剪辑级标签训练该模型，该模型也能够检测帧级信息，例如声音的时间位置。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2017年|791-795|共5页
会议地点
作者
Ting-Wei Su; Jen-Yu Liu; Yi-Hsuan Yang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Data models; Training; Predictive models; Convolution; Training data; Event detection; Neural networks;

机译：数据模型;训练;预测模型;卷积;训练数据;事件检测;神经网络;

相似文献

外文文献
中文文献
专利

1. Weakly-supervised learning for community detection based on graph convolution in attributed networks [J] . Wang Xiaofeng, Li Jianhua, Yang Li, International journal of machine learning and cybernetics . 2021,第12期

机译：基于Graph卷积的社区检测学习弱监督
2. Convolutional recurrent neural networks with multi-sized convolution filters for sound-event recognition [J] . Huang Feizhen, Zeng Jinfang, Zhang Yu, Modern Physics Letters, B. Condensed Matter Physics, Statistical Physics, Applied Physics . 2020,第23期

机译：具有多尺寸卷积滤波器的卷积经常性神经网络，用于声音事件识别
3. Audio style transfer using shallow convolutional networks and random filters [J] . Jiyou Chen, Gaobo Yang, Huihuang Zhao, Multimedia Tools and Applications . 2020,第21a22期

机译：使用浅卷积网络和随机滤波器进行音频风格转移
4. Weakly-supervised audio event detection using event-specific Gaussian filters and fully convolutional networks [C] . Ting-Wei Su, Jen-Yu Liu, Yi-Hsuan Yang IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：使用特定事件的高斯滤波器和完全卷积网络的弱监督音频事件检测
5. Finding Event-Specific Influencers in Dynamic Social Networks . [D] . Schenk, Christopher Brendan. 2010

机译：在动态社交网络中查找特定于事件的影响者。
6. Ear Detection Using Convolutional Neural Network on Graphs with Filter Rotation [O] . Arkadiusz Tomczyk, Piotr S. Szczepaniak 2019

机译：用滤波器旋转的卷积神经网络使用卷积神经网络的耳朵检测
7. R-CRNN: Region-based Convolutional Recurrent Neural Network for Audio Event Detection [O] . Chieh-Chi Kao, Weiran Wang, Ming Sun, 2018

机译：R-CRNN：基于区域的卷积复制神经网络，用于音频事件检测

Weakly-supervised audio event detection using event-specific Gaussian filters and fully convolutional networks

摘要

著录项

相似文献

相关主题

期刊订阅