首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >The Idlab Voxsrc-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification
【24h】

The Idlab Voxsrc-20 Submission: Large Margin Fine-Tuning and Quality-Aware Score Calibration in DNN Based Speaker Verification

机译:IDLAB VOXSRC-20提交:基于DNN的扬声器验证中的大型边缘微调和质量感知分数校准

获取原文

摘要

In this paper we propose and analyse a large margin fine-tuning strategy and a quality-aware score calibration in text-independent speaker verification. Large margin fine-tuning is a secondary training stage for DNN based speaker verification systems trained with margin-based loss functions. It enables the network to create more robust speaker embeddings by enabling the use of longer training utterances in combination with a more aggressive margin penalty. Score calibration is a common practice in speaker verification systems to map output scores to well-calibrated log-likelihood-ratios, which can be converted to interpretable probabilities. By including quality features in the calibration system, the decision thresholds of the evaluation metrics become quality-dependent and more consistent across varying trial conditions. Applying both enhancements on the ECAPA-TDNN architecture leads to state-of-the-art results on all publicly available VoxCeleb1 test sets and contributed to our winning submissions in the supervised verification tracks of the Vox-Celeb Speaker Recognition Challenge 2020.
机译:在本文中,我们提出并分析了较为独立的扬声器验证中的大型保证金微调策略和质量意识的评分校准。大型边缘微调是一种基于DNN的扬声器验证系统的二级训练阶段,培训了基于边缘的损耗功能。它使网络能够通过使使用更长的培训话语与更积极的保证金惩罚组合使用更长的培训话语来创建更强大的扬声器嵌入式。评分校准是扬声器验证系统中的常见做法,以将输出分数映射到良好的校准概率 - 比率,这可以转换为可解释的概率。通过包括校准系统中的质量特征,评估度量的判定阈值变为不同的试验条件的质量依赖性和更一致。在ECAPA-TDNN架构上应用增强功能导致所有公开的VoxceB1测试集的最先进的结果,并为我们在Vox-Celeb扬声器识别挑战2020的监督验证轨道中提供了我们的获奖。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号