A Comparison of Covariance Matrix and i-vector Based Speaker Recognition

机译：基于协方差矩阵和基于i向量的说话人识别的比较

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The paper presents results of an evaluation of covariance matrix and i-vector based speaker identification methods on Serbian S70W100s120 database. Open set speaker identification evaluation scheme was adopted. The number of target speakers and the number of impostors were 20 and 60 respectively. Additional utterances from 41 speakers were used for training. Amount of data for modeling a target speaker was limited to about 4 s of speech. In this study, the i-vector base approach showed significantly better performance (equal error rate EER ~5%) than the covariance matrix based approach (EER ~ 16%). This small EER for the i-vector based approach was obtained after substantial reduction of the number of the parameters in universal background model, i-vector transformation matrix and Gaussian probabilistic linear discriminant analysis that is typically reported in the papers. Additionally, these experiments showed that cepstral mean and variance normalization can deteriorate EER in case of a single channel.

机译：本文介绍了在塞尔维亚S70W100s120数据库上评估基于协方差矩阵和基于i-vector的说话人识别方法的结果。采用开放式说话人识别评估方案。演讲者的人数和冒名顶替者的人数分别为20和60。来自41位演讲者的其他言论被用于培训。用于建模目标说话者的数据量被限制为大约4 s的语音。在这项研究中，基于i-vector的方法显示出比基于协方差矩阵的方法（EER〜16％）更好的性能（等效错误率EER约5％）。在基于通用背景模型，i-向量变换矩阵和高斯概率线性判别分析的参数数量大大减少之后，获得了这种基于i-vector的方法的较小EER，这在论文中通常会有所报道。此外，这些实验表明，在单个通道的情况下，倒频谱均值和方差归一化会恶化EER。

著录项

来源
《International Conference on speech and computer》|2017年|37-45|共9页
会议地点
作者
Niksa Jakovljevic; Ivan Jokic; Slobodan Josic; Vlado Delic;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaker identification; i-vector; G-PLDA; Covariance matrix; S70W100s120;

机译：说话人识别;向量G-PLDA;协方差矩阵S70W100s120;

相似文献

外文文献
中文文献
专利

1. Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors [J] . Maghsoodi Nooshin, Sameti Hossein, Zeinal Hossein, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第11期

机译：基于不确定性归一化HMM的i向量的带有随机数字字符串的说话人识别
2. I-vector Extraction for Speaker Recognition Based on Dimensionality Reduction [J] . Noor Salwani Ibrahim, Dzati Athiar Ramli Procedia Computer Science . 2018,第1期

机译：基于降维的I向量提取用于说话人识别
3. I-vector based speaker recognition using advanced channel compensation techniques [J] . Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Computer speech and language . 2014,第1期

机译：使用高级通道补偿技术的基于I矢量的说话人识别
4. A Comparison of Covariance Matrix and i-vector Based Speaker Recognition [C] . Niksa Jakovljevic, Ivan Jokic, Slobodan Josic, International Conference on Speech and Computer . 2017

机译：协方差矩阵与基于载体扬声器识别的比较
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. A Kernel Gabor-Based Weighted Region Covariance Matrix for Face Recognition [O] . Huafeng Qin, Lan Qin, Lian Xue, 2012

机译：基于核Gabor的加权区域协方差矩阵用于人脸识别
7. Speaker line-up calibration of the i-vector based speaker recognition system for forensic application [O] . Mandasari M.I., McLaren M.L., Leeuwen D.A. van 2011

机译：用于法医应用的基于i矢量的说话人识别系统的说话人阵容校准
8. Noise Robust I-Vector Extractor Using Vector Taylor Series For Speaker Recognition. [R] . Lei, Y., Burget, L., Scheffer, N. 2013

机译：使用矢量泰勒级数进行说话人识别的噪声鲁棒I-向量提取器。

A Comparison of Covariance Matrix and i-vector Based Speaker Recognition

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅