首页> 外文期刊>Computer speech and language >I-vector based speaker recognition using advanced channel compensation techniques
【24h】

I-vector based speaker recognition using advanced channel compensation techniques

机译:使用高级通道补偿技术的基于I矢量的说话人识别

获取原文
获取原文并翻译 | 示例
       

摘要

This paper investigates advanced channel compensation techniques for the purpose of improving i-vector speaker verification performance in the presence of high intersession variability using the NIST 2008 and 2010 SRE corpora. The performance of four channel compensation techniques: (a) weighted maximum margin criterion (WMMC), (b) source-normalized WMMC (SN-WMMC), (c) weighted linear discriminant analysis (WLDA) and (d) source-normalized WLDA (SN-WLDA) have been investigated. We show that, by extracting the discriminatory information between pairs of speakers as well as capturing the source variation information in the development i-vector space, the SN-WLDA based cosine similarity scoring (CSS) i-vector system is shown to provide over 20% improvement in EER for NIST 2008 interview and microphone verification and over 10% improvement in EER for NIST 2008 telephone verification, when compared to SN-LDA based CSS i-vector system. Further, score-level fusion techniques are analyzed to combine the best channel compensation approaches, to provide over 8% improvement in DCF over the best single approach, SN-WLDA, for NIST 2008 interview/telephone enrolment-verification condition. Finally, we demonstrate that the improvements found in the context of CSS also generalize to state-of-the-art GPLDA with up to 14% relative improvement in EER for NIST SRE 2010 interview and microphone verification and over 7% relative improvement in EER for NIST SRE 2010 telephone verification.
机译:本文研究了先进的信道补偿技术,目的是在使用NIST 2008和2010 SRE语料库的高会话间可变性的情况下提高i-vector说话者验证性能。四种信道补偿技术的性能:(a)加权最大余量标准(WMMC),(b)源归一化WMMC(SN-WMMC),(c)加权线性判别分析(WLDA)和(d)源归一化WLDA (SN-WLDA)已被调查。我们表明,通过提取说话者对之间的歧视性信息以及捕获开发i-vector空间中的源变化信息,基于SN-WLDA的余弦相似性评分(CSS)i-vector系统可提供20多个与基于SN-LDA的CSS i-vector系统相比,NIST 2008采访和麦克风验证的EER提高了10%,NIST 2008电话验证的EER提高了10%以上。此外,针对NIST 2008采访/电话入学验证条件,分析了分数级融合技术以结合最佳的信道补偿方法,以使DCF相对于最佳的单一方法SN-WLDA改善8%以上。最后,我们证明,在CSS上下文中发现的改进也可以推广到最新的GPLDA,NIST SRE 2010采访和麦克风验证的EER相对改进高达14%,而EIST的EER相对改进则高达7%。 NIST SRE 2010电话验证。

著录项

  • 来源
    《Computer speech and language》 |2014年第1期|121-140|共20页
  • 作者单位

    Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

    Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

    Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

    Centre for Language and Speech Technology, Radboud University Nijmegen, The Netherlands;

    Speech and Audio Research Lab, SAIVT, Queensland University of Technology, Australia;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Speaker verification; I-vector; GPLDA; LDA; SN-LDA; WLDA; SN-WLDA;

    机译:说话者验证;载体GPLDA;LDA;SN-LDA;WLDA;SN-WLDA;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号