首页> 外文期刊>Computers and Electrical Engineering >Investigation of the effect of data duration and speaker gender on text-independent speaker recognition
【24h】

Investigation of the effect of data duration and speaker gender on text-independent speaker recognition

机译:研究数据持续时间和说话人性别对与文本无关的说话人识别的影响

获取原文
获取原文并翻译 | 示例
       

摘要

Duration of training/test data has a considerable effect on the performance of a speaker recognition system. In this paper, we analyze the effect of training and test data duration and speaker gender on the performance of speaker recognition systems. Gaussian mixture models-universal background model (GMM-UBM), vector quantization-universal background model (VQ-UBM), support vector machines-generalized linear discriminant sequence kernel (SVM-GLDS) and support vector machines with GMM supervectors (GSV-SVM) are the classifiers we use for speaker recognition. Experimental results conducted on NIST 2002 and NIST 2005 speaker recognition evaluation (SRE) databases show that recognition performance breaks down when short utterances are used for training and testing independent from the recognizer (e.g. equal error rate (EER) reduces from 10.33% to 27.86% on NIST 2005) and GSV-SVM system yields higher EER than other methods in the case of using short utterances. It is also shown that recognition accuracy for male speakers are higher than female independent from database and classifier.
机译:训练/测试数据的持续时间对说话人识别系统的性能有很大影响。在本文中,我们分析了训练和测试数据持续时间以及说话者性别对说话者识别系统性能的影响。高斯混合模型-通用背景模型(GMM-UBM),矢量量化-通用背景模型(VQ-UBM),支持向量机-广义线性判别序列内核(SVM-GLDS)和带有GMM超向量的支持向量机(GSV-SVM) )是我们用于说话人识别的分类器。在NIST 2002和NIST 2005说话人识别评估(SRE)数据库上进行的实验结果表明,当短语音用于独立于识别器的训练和测试时,识别性能会下降(例如,等错误率(EER)从10.33%降低至27.86%在NIST 2005上)和GSV-SVM系统在使用短发声的情况下产生的EER比其他方法更高。还表明,独立于数据库和分类器的男性说话者的识别准确度高于女性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号