首页> 外文会议>International Conference on Communication, Image and Signal Processing >Classification of the Mask Augsburg Speech Corpus (MASC) Using the Consistency Learning Method
【24h】

Classification of the Mask Augsburg Speech Corpus (MASC) Using the Consistency Learning Method

机译:使用一致学习方法对奥格斯堡语音语料库(MASC)进行分类

获取原文

摘要

This paper presents the details of our solution for the mask sub-task of the INTERSPEECH 2020 Computational Para-linguistics Challenge (ComParE). The speech production can be significantly affected when the speaker wears a face mask. The task evaluates the systems for the classification of speech recordings with and without a surgical mask. A student-teacher deep-learning neural network is proposed inspired by the wellperformed consistency learning method on a lot of classification problems. In particular, the consistency regularization term is designed between out-puts of the student model and the guided teacher model. Different level Gaussian noises are respectively added into the inputs of the teacher model as model perturbations to optimize the system robustness. To take further advantage of the consistency learning, a small number of unlabeled evaluation data is utilized to be combined with the labeled data for the system training in a semi-supervised learning manner. Finally, the proposed system achieves an unweighted average recall up to 72.50% on the official evaluation dataset, increasing by 10% compared with the baseline result of 62.60%.
机译:本文介绍了我们对屏蔽的掩模子任务的解决方案的细节2020计算Para-Linguics挑战(比较)。当扬声器佩戴面罩时,语音生产可能会受到显着影响。该任务评估了具有手术面具的语音录制的分类系统。学生教师深学习神经网络是由Wellimerformed一致性学习方法提出的,在大量分类问题上启发。特别是,一致性正则化术语是在学生模型和引导教师模型的推出之间设计的。不同的级别高斯噪声分别添加到教师模型的输入中作为模型扰动,以优化系统的鲁棒性。为了采取一致性学习的进一步优势,利用少量未标记的评估数据以半监督学习方式与系统培训的标记数据组合。最后,拟议的系统在官方评估数据集中实现了高达72.50%的未加权平均召回,与32.60%的基线结果相比增长了10%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号