首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS
【24h】

UNSUPERVISED DOMAIN ADAPTATION FOR GENDER-AWARE PLDA MIXTURE MODELS

机译:针对性别感知PLDA混合模型的无监督域适应

获取原文

摘要

Probabilistic linear discriminant analysis (PLDA) is a state-of-art back-end for i-vector based speaker verification. However, this back-end is still problematic when (1) the model is deployed to new environment (in-domain) that is very different from the training one (out-of-domain) and (2) there are insufficient labeled data from the new environment. To address these problems, this paper proposes using out-of-domain training data to pre-train a PLDA mixture model and applying the mixture model on the in-domain training data to compute a pairwise score matrix for spectral clustering. The hypothesized speaker labels produced by spectral clustering are then used for re-training the mixture model to fit the new environment. To refine the mixture model, the spectral clustering and re-training processes are repeated a number of times. To make the mixture model amenable to both genders, a deep neural network (DNN) is trained to produce gender posteriors given an i-vector. The gender posteriors then replace the posterior probabilities of the indicator variables in the PLDA mixture model. Evaluations based on NIST 2016 SRE suggest that at the end of the iterative re-training, the PLDA mixture model becomes fully adapted to the new domain. Results also show that the PLDA scores can be readily incorporated into spectral clustering, resulting in high quality speaker clusters that could not be possibly achieved by agglomerative hierarchical clustering.
机译:概率线性判别分析(PLDA)是基于I载体的扬声器验证的最先进的后端。然而,当(1)将模型部署到与训练(Out-Divel)的新环境(域内)部署到新的环境(域中)和(2)时,此后端仍然存在问题,其中有足够的标记数据新环境。为了解决这些问题,本文建议使用域外训练数据来预先列车,并将混合模型应用于域培训数据,以计算用于光谱聚类的成对得分矩阵。然后使用光谱聚类产生的假设扬声器标签来重新培训混合模型以适应新环境。为了优化混合模型,频谱聚类和重新训练过程重复多次。为了使混合物模型适用于两者的性别,训练深神经网络(DNN)以给予I形载体的性别后海前。然后,性别后续仪在PLDA混合物模型中替换指示器变量的后验概率。基于NIST 2016 SRE的评估表明,在迭代重新培训结束时,PLDA混合物模型完全适应新域。结果还表明,PLDA分数可以容易地结合到光谱聚类中,导致高质量的扬声器簇,其无法通过附聚层间聚类来实现。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号