首页> 外文会议>International Conference on speech and computer >A Comparison of Covariance Matrix and i-vector Based Speaker Recognition
【24h】

A Comparison of Covariance Matrix and i-vector Based Speaker Recognition

机译:基于协方差矩阵和基于i向量的说话人识别的比较

获取原文

摘要

The paper presents results of an evaluation of covariance matrix and i-vector based speaker identification methods on Serbian S70W100s120 database. Open set speaker identification evaluation scheme was adopted. The number of target speakers and the number of impostors were 20 and 60 respectively. Additional utterances from 41 speakers were used for training. Amount of data for modeling a target speaker was limited to about 4 s of speech. In this study, the i-vector base approach showed significantly better performance (equal error rate EER ~5%) than the covariance matrix based approach (EER ~ 16%). This small EER for the i-vector based approach was obtained after substantial reduction of the number of the parameters in universal background model, i-vector transformation matrix and Gaussian probabilistic linear discriminant analysis that is typically reported in the papers. Additionally, these experiments showed that cepstral mean and variance normalization can deteriorate EER in case of a single channel.
机译:本文介绍了在塞尔维亚S70W100s120数据库上评估基于协方差矩阵和基于i-vector的说话人识别方法的结果。采用开放式说话人识别评估方案。演讲者的人数和冒名顶替者的人数分别为20和60。来自41位演讲者的其他言论被用于培训。用于建模目标说话者的数据量被限制为大约4 s的语音。在这项研究中,基于i-vector的方法显示出比基于协方差矩阵的方法(EER〜16%)更好的性能(等效错误率EER约5%)。在基于通用背景模型,i-向量变换矩阵和高斯概率线性判别分析的参数数量大大减少之后,获得了这种基于i-vector的方法的较小EER,这在论文中通常会有所报道。此外,这些实验表明,在单个通道的情况下,倒频谱均值和方差归一化会恶化EER。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号