首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection
【24h】

An Improved Mean Teacher Based Method for Large Scale Weakly Labeled Semi-Supervised Sound Event Detection

机译:一种改进的大规模弱标记半监控声音事件检测方法

获取原文

摘要

This paper presents an improved mean teacher (MT) based method for large-scale weakly labeled semi-supervised sound event detection (SED), by focusing on learning a better student model. Two main improvements are proposed based on the authors’ previous perturbation based MT method. Firstly, an event-aware module is de-signed to allow multiple branches with different kernel sizes to be fused via an attention mechanism. By inserting this module after the convolutional layer, each neuron can adaptively adjust its receptive field to suit different sound events. Secondly, instead of using the teacher model to provide a consistency cost term, we propose using a stochastic inference of unlabeled examples to generate high quality pseudo-targets by averaging multiple predictions from the perturbed student model. MixUp of both labeled and unlabeled data is further exploited to improve the effectiveness of student model. Finally, the teacher model can be obtained via exponential moving average (EMA) of the student model, which generates final predictions for SED during inference. Experiments on the DCASE2018 task4 dataset demonstrate the ability of the proposed method. Specifically, an F1-score of 42.1% is achieved, significantly outperforming the 32.4% achieved by the winning system, or the 39.3% by the previous perturbation based method.
机译:本文通过专注于学习更好的学生模型,提出了一种改进的大规模弱标记半监控声音事件检测(SED)的方法。基于作者之前的基于扰动的MT方法提出了两个主要改进。首先,将事件感知模块解除签名以允许具有不同内核大小的多个分支通过注意机制融合。通过在卷积层之后插入该模块,每个神经元可以自适应地调整其接收领域以适应不同的声音事件。其次,代替使用教师模型提供一致性成本术语,我们建议使用未标记示例的随机推断来通过平均来自扰动的学生模型的多个预测来生成高质量的伪目标。进一步利用标记和未标记数据的混合来提高学生模型的有效性。最后,可以通过学生模型的指数移动平均(EMA)获得教师模型,其在推理期间为SED产生最终预测。 DCEST2018 Task4数据集上的实验证明了所提出的方法的能力。具体而言,实现了42.1%的F1分数,显着优于获胜系统所实现的32.4%,或通过先前的基于扰动的方法实现的39.3%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号