Knowing the non-target speakers: The effect of the i-vector population for PLDA training in speaker recognition

机译：了解非目标说话人：i-vector种群对说话人识别中PLDA训练的影响

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Inspired by the NIST SRE-2012 evaluation conditions we train the PLDA classifier in an i-vector speaker recognition system with different speaker populations, either including or excluding the target speakers in the evaluation. Including the target speakers in the PLDA training is always beneficial w.r.t. completely excluding them—which is the normal situation in pre-2012 SRE protocols—even in the Pknown = 0 evaluation condition. However, adding other speakers than just the targets speakers can slightly increase performance. We also investigated the effect of adding i-vectors extracted from segments with added noise in the PLDA training. This generally makes the system more robust to noise in the test segments, and doesn't hurt performance in the clean condition. The paper further details the 'simple to compound' log-likelihood-ratio conversion necessary for SRE-2012 style calibration.

机译：受NIST SRE-2012评估条件的启发，我们在i-vector说话者识别系统中训练了PLDA分类器，该系统具有不同的说话者群体，包括或不包括评估中的目标说话者。在PLDA培训中包括目标演讲者总是有益的。即使在P = 0评估条件下，也完全排除了它们（这是2012年以前的SRE协议中的正常情况）。但是，添加除目标扬声器之外的其他扬声器可以稍微提高性能。我们还研究了在PLDA训练中添加从段中提取的i-vector以及添加的噪声的效果。通常，这会使系统对测试段中的噪声更加健壮，并且在清洁条件下不会损害性能。本文进一步详细介绍了SRE-2012样式校准所需的“简单到复合”对数似然比转换。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2013年|6778-6782|共5页
会议地点
作者
van Leeuwen David A.; Saeidi Rahim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
PLDA; Speaker recognition; calibration; i-vector; noise robustness;

机译：PLDA说话人识别校准i矢量噪声鲁棒性;
入库时间 2022-08-26 15:10:28

相似文献

外文文献
中文文献
专利

1. Nonparametrically trained PLDA for short duration i-vector speaker verification [J] . Abbas Khosravani, Mohammad M. Homayounpour Computer speech and language . 2018,第NOVa期

机译：非参数训练的PLDA，用于短时i向量说话者验证
2. Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification [J] . Osman Büyük Signal Processing, IET . 2016,第8期

机译：基于Sentence-HMM状态的i-vector / PLDA建模可提高与文本相关的单个说话者说话人验证的性能
3. From single to multiple enrollment i-vectors: Practical PLDA scoring variants for speaker verification [J] . Padmanabhan Rajan, Anton Afanasyev, Ville Hautam?ki, Digital Signal Processing . 2014,第Null期

机译：从单个注册到多个注册i向量：用于说话人验证的实用PLDA评分变体
4. KNOWING THE NON-TARGET SPEAKERS: THE EFFECT OF THE I-VECTOR POPULATION FOR PLDA TRAINING IN SPEAKER RECOGNITION [C] . David A. van Leeuwen, Rahim Saeidi IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：了解非目标扬声器：I - 矢量人口对扬声器认可的PLDA培训的影响
5. Discriminative training for speaker adaptation and minimum Bayes risk estimation in large vocabulary speech recognition. [D] . Doumpiotis, Vlasios. 2005

机译：大词汇量语音识别中的说话人适应性和最低贝叶斯风险估计的判别训练。
6. Revisiting vocal perception in non-human animals: a review of vowel discrimination speaker voice recognition and speaker normalization [O] . Buddhamas Kriengwatana, Paola Escudero, Carel ten Cate 2014

机译：重温非人类动物的声音感知：元音辨别说话人语音识别和说话人正常化的综述
7. End-to-end DNN Based Speaker Recognition Inspired by i-vector and PLDA [O] . Rohdin, Johan, Silnova, Anna, Diez, Mireia, 2018

机译：基于端到端DNN的说话人识别灵感来自i-vector和pLDa
8. Channel Compensation for Speaker Recognition using MAP Adapted PLDA and Denoising DNNs. [R] . Richardson, F. S., Reynolds, D. A., Nemsick, B. 2016

机译：使用map自适应pLDa和去噪DNN进行说话人识别的信道补偿。

Knowing the non-target speakers: The effect of the i-vector population for PLDA training in speaker recognition

摘要

著录项

相似文献

相关主题

期刊订阅