Investigation of the effect of data duration and speaker gender on text-independent speaker recognition

Cemal Hanilci; Figen Ertas

首页> 外文期刊>Computers and Electrical Engineering >Investigation of the effect of data duration and speaker gender on text-independent speaker recognition

【24h】

Investigation of the effect of data duration and speaker gender on text-independent speaker recognition

机译：研究数据持续时间和说话人性别对与文本无关的说话人识别的影响

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Duration of training/test data has a considerable effect on the performance of a speaker recognition system. In this paper, we analyze the effect of training and test data duration and speaker gender on the performance of speaker recognition systems. Gaussian mixture models-universal background model (GMM-UBM), vector quantization-universal background model (VQ-UBM), support vector machines-generalized linear discriminant sequence kernel (SVM-GLDS) and support vector machines with GMM supervectors (GSV-SVM) are the classifiers we use for speaker recognition. Experimental results conducted on NIST 2002 and NIST 2005 speaker recognition evaluation (SRE) databases show that recognition performance breaks down when short utterances are used for training and testing independent from the recognizer (e.g. equal error rate (EER) reduces from 10.33% to 27.86% on NIST 2005) and GSV-SVM system yields higher EER than other methods in the case of using short utterances. It is also shown that recognition accuracy for male speakers are higher than female independent from database and classifier.

机译：训练/测试数据的持续时间对说话人识别系统的性能有很大影响。在本文中，我们分析了训练和测试数据持续时间以及说话者性别对说话者识别系统性能的影响。高斯混合模型-通用背景模型（GMM-UBM），矢量量化-通用背景模型（VQ-UBM），支持向量机-广义线性判别序列内核（SVM-GLDS）和带有GMM超向量的支持向量机（GSV-SVM））是我们用于说话人识别的分类器。在NIST 2002和NIST 2005说话人识别评估（SRE）数据库上进行的实验结果表明，当短语音用于独立于识别器的训练和测试时，识别性能会下降（例如，等错误率（EER）从10.33％降低至27.86％在NIST 2005上）和GSV-SVM系统在使用短发声的情况下产生的EER比其他方法更高。还表明，独立于数据库和分类器的男性说话者的识别准确度高于女性。

著录项

来源
《Computers and Electrical Engineering》 |2013年第2期|共12页
作者
Cemal Hanilci; Figen Ertas;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类计算机的应用;
关键词
入库时间 2022-08-18 09:39:46

相似文献

外文文献
中文文献
专利

1. Investigation of the effect of data duration and speaker gender on text-independent speaker recognition [J] . Cemal Hanilci, Figen Ertas Computers and Electrical Engineering . 2013,第2期

机译：研究数据持续时间和说话人性别对与文本无关的说话人识别的影响
2. Speaker-specific mapping for text-independent speaker recognition [J] . Hemant Misra, Shajith Ikbal, B. Yegnanarayana Speech Communication . 2003,第3a4期

机译：特定于说话人的映射，用于与文本无关的说话人识别
3. Data-model relationship in text-independent speaker recognition [J] . Mason JSD, Evans NWD, Stapert R, EURASIP journal on applied signal processing . 2005,第4期

机译：文本无关的说话人识别中的数据模型关系
4. ADAPTIVE 3-D DATA COMPRESSION ALGORITHM OF SPEAKER DATA FOR TEXT-INDEPENDENT SPEAKER RECOGNITION [C] . Shung-Yung Lung IASTED (the International Association of Science and Technology for Development) International Conference on Signal Processing, Pattern Recognition, and Application, Jun 25-28, 2002, Crete, Greece . 2002

机译：文本无关的说话人识别的说话人数据自适应3-D数据压缩算法
5. Text-independent Speaker Recognition Using Discriminative Subspace Analysis [D] . Jiang, Weiwu 2012

机译：区分子空间分析的文本无关说话人识别
6. Is un stylo sharper than une épée? Investigating the interaction of sound symbolism and grammatical gender in English and French speakers [O] . David M. Sidhu, Penny M. Pexman, Jean Saint-Aubin, 2019

机译：UN STYMEO SHARPER比UNEÉPÉE更尖锐吗？调查英语和法语扬声器的声音象征与语法性别的互动
7. Frame-Level Speaker Embeddings for Text-Independent Speaker Recognition and Analysis of End-to-End Model [O] . Suwon Shon, Hao Tang, James Glass 2018

机译：帧级扬声器嵌入文本独立扬声器识别和结束模型分析

Investigation of the effect of data duration and speaker gender on text-independent speaker recognition

摘要

著录项

相似文献

相关主题

期刊订阅