...
首页> 外文期刊>Pattern recognition letters >Exploring kernel discriminant analysis for speaker verification with limited test data
【24h】

Exploring kernel discriminant analysis for speaker verification with limited test data

机译:探索内核判别分析以使用有限的测试数据进行说话人验证

获取原文
获取原文并翻译 | 示例
           

摘要

Speaker verification (SV) with limited test data condition is desirable for practical application oriented systems. The i-vector based speaker modeling has shown its significance for SV tasks, but its performance degrades as the utterance becomes shorter. The i-vectors apart from being compact and dominant speaker representations, bear channel and session information, which has to be compensated for robust speaker modeling. The conventional techniques for channel/ session compensation include linear discriminant analysis (LDA) followed by within class covariance normalization (WCCN) and Gaussian probabilistic linear discriminant analysis (GPLDA) that eliminate the channel/ session variation across the i-vectors by assuming these are linearly separable. In this work, a novel method for channel/ session compensation is proposed using kernel discriminant analysis (KDA) that projects the i-vectors into a higher dimensional space and performs discriminant analysis to remove the unwanted information for speaker modeling. The SV studies are performed on standard NIST speaker recognition evaluation (SRE) 2003 and 2008 databases that convey the significance of the proposed compensation over the conventional methods, which is greater on using short test utterances. The achieved improvements are hypothesized due to the non-linearities of channel/ session information in the i-vector domain. (C) 2017ElsevierB. V. Allrightsreserved.
机译:具有有限测试数据条件的说话者验证(SV)对于面向实际应用的系统是理想的。基于i向量的说话人建模已显示出其对SV任务的重要性,但随着说话时间的缩短,其性能会下降。 i向量除了紧凑而占主导地位的说话者表示外,还具有频道和会话信息,必须对这些向量进行补偿才能实现强大的说话人建模。通道/会话补偿的常规技术包括线性判别分析(LDA),然后进行类内协方差归一化(WCCN)和高斯概率线性判别分析(GPLDA),它们通过假设i向量是线性的来消除通道/会话变化。可分离。在这项工作中,提出了一种使用内核判别分析(KDA)的信道/会话补偿的新方法,该方法将i向量投影到更高维度的空间中,并执行判别分析以去除不需要的信息以进行说话人建模。 SV研究是在标准的NIST说话者识别评估(SRE)2003和2008数据库上进行的,这些数据库传达了建议的补偿方法对传统方法的重要性,这在使用简短测试发音时更为明显。假设由于i向量域中通道/会话信息的非线性而实现了改进。 (C)2017爱思唯尔B. V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号