...
首页> 外文期刊>International journal of speech technology >A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems
【24h】

A study on the effects of using short utterance length development data in the design of GPLDA speaker verification systems

机译:在GPLDA说话人验证系统设计中使用短发声长度发展数据的效果研究

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper studies the performance degradation of Gaussian probabilistic linear discriminant analysis (GPLDA) speaker verification system, when only short-utterance data is used for speaker verification system development. Subsequently, a number of techniques, including utterance partitioning and source-normalised weighted linear discriminant analysis (SN-WLDA) projections are introduced to improve the speaker verification performance in such conditions. Experimental studies have found that when short utterance data is available for speaker verification development, GPLDA system overall achieves best performance with a lower number of universal background model (UBM) components. As a lower number of UBM components significantly reduces the computational complexity of speaker verification system, that is a useful observation. In limited session data conditions, we propose a simple utterance-partitioning technique, which when applied to the LDA-projected GPLDA system shows over 8% relative improvement on EER values over baseline system on NIST 2008 truncated 10-10 s conditions. We conjecture that this improvement arises from the apparent increase in the number of sessions arising from our partitioning technique and this helps to better model the GPLDA parameters. Further, partitioning SN-WLDA-pro-jected GPLDA shows over 16% and 6% relative improvement on EER values over LDA-projected GPLDA systems respectively on NIST 2008 truncated 10-10 s interview-interview, and NIST 2010 truncated 10-10 s interview-interview and telephone-telephone conditions.
机译:当仅短话数据用于说话人验证系统开发时,本文研究了高斯概率线性判别分析(GPLDA)说话人验证系统的性能下降。随后,引入了多种技术,包括话语划分和源归一化加权线性判别分析(SN-WLDA)投影,以改善这种情况下的说话者验证性能。实验研究发现,当简短的说话数据可用于说话者验证开发时,GPLDA系统总体上以较少的通用背景模型(UBM)组件实现了最佳性能。由于较少数量的UBM组件显着降低了说话人验证系统的计算复杂度,因此非常有用。在有限的会话数据条件下,我们提出了一种简单的发声分割技术,该技术应用于LDA投影的GPLDA系统时,在10-10 s的NIST 2008条件下,相对于基线系统,EER值相对提高了8%以上。我们推测,这种改进是由于我们的分区技术引起的会话数的明显增加而引起的,这有助于更好地建模GPLDA参数。此外,对SN-WLDA预测的GPLDA进行分区显示,相对于LDA预测的GPLDA系统,分别在NIST 2008截断访谈10-10 s和NIST 2010截断访谈10-10 s时,EER值相对提高了16%和6%。采访-采访和电话-电话条件。

著录项

  • 来源
    《International journal of speech technology》 |2017年第2期|247-259|共13页
  • 作者单位

    Department of Electrical & Electronic Engineering, Faculty of Engineering, University of Jaffna, Kilinochchi, Sri Lanka,Speech Research Lab, SAIVT, Queensland University of Technology, Brisbane, QLD, Australia;

    Speech Research Lab, SAIVT, Queensland University of Technology, Brisbane, QLD, Australia;

    Speech Research Lab, SAIVT, Queensland University of Technology, Brisbane, QLD, Australia;

    Speech Research Lab, SAIVT, Queensland University of Technology, Brisbane, QLD, Australia;

    Speech Research Lab, SAIVT, Queensland University of Technology, Brisbane, QLD, Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号