DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS

机译：基于I型向量的扬声器识别系统的持续时间不匹配补偿

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker recognition systems trained on long duration utterances are known to perform significantly worse when short test segments are encountered. To address this mismatch, we analyze the effect of duration variability on phoneme distributions of speech utterances and i-vector length. We demonstrate that, as utterance duration is decreased, number of detected unique phonemes and i-vector length approaches zero in a logarithmic and non-linear fashion, respectively. Assuming duration variability as an additive noise in the i-vector space, we propose three different strategies for its compensation: i) multi-duration training in Probabilistic Linear Discriminant Analysis (PLDA) model, ii) score calibration using log duration as a Quality Measure Function (QMF), and iii) multi-duration PLDA training with synthesized short duration i-vectors. Experiments are designed based on the 2012 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE) protocol with varying test utterance duration. Experimental results demonstrate the effectiveness of the proposed schemes on short duration test conditions, especially with the QMF calibration approach.

机译：扬声器识别系统已知在遇到短的测试段时，已知在长期持续时间发声中培训的系统显着更差。为了解决这种不匹配，我们分析了持续时间可变性对语音发声和I形向量长度的音素分布的影响。我们证明，随着话语持续时间减少，检测到的独特音素和I形载体长度分别以对数和非线性方式接近零。假设持续时间可变性作为I - 矢量空间中的添加性噪声，我们提出了三种不同的补偿策略：i）概率线性判别分析（PLDA）模型中的多持续时间培训，ii）使用日志持续时间作为质量测量的评分校准功能（QMF）和III）多持续时间PLDA培训，具有合成短持续时间I-向量。实验是根据2012年国家标准和技术研究所（NIST）扬声器识别评估（SRE）协议的设计，具有不同的测试话语持续时间。实验结果表明了提出的方案在短时间内测试条件下的有效性，特别是QMF校准方法。

著录项

来源
《IEEE International Conference on Acoustics, Speech, and Signal Processing》|2013年||共5页
会议地点
作者
Taufiq Hasan; Rahim Saeidi; John H. L. Hansen; David A. van Leeuwen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词

相似文献

外文文献
中文文献
专利

1. Duration compensation of i-vectors for short duration speaker verification [J] . Jianbo Ma, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Electronics Letters . 2017,第6期

机译：i向量的持续时间补偿，用于短时说话者验证
2. I-vector based speaker recognition using advanced channel compensation techniques [J] . Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Computer speech and language . 2014,第1期

机译：使用高级通道补偿技术的基于I矢量的说话人识别
3. Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors [J] . Maghsoodi Nooshin, Sameti Hossein, Zeinal Hossein, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第11期

机译：基于不确定性归一化HMM的i向量的带有随机数字字符串的说话人识别
4. Duration mismatch compensation for i-vector based speaker recognition systems [C] . Hasan Taufiq, Saeidi Rahim, Hansen John H.L., IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：基于i向量的说话人识别系统的持续时间不匹配补偿
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Recognition and repair of compound DNA lesions (base damage and mismatch) by human mismatch repair and excision repair systems. [O] . D Mu, M Tursun, D R Duckett, 1997

机译：通过人类错配修复和切除修复系统识别和修复复合DNA损伤（碱基损伤和错配）。
7. Duration mismatch compensation for i-vector based speaker recognition systems [O] . Hasan T., Saeidi R., Hanson J.H.L., 2013

机译：基于i向量的说话人识别系统的持续时间不匹配补偿
8. DOMAIN MISMATCH COMPENSATION FOR SPEAKER RECOGNITION USING A LIBRARY OF WHITENERS. [R] . Singer, E., Reynolds, D. A. 2015

机译：利用白人图书馆进行演讲者识别的域名失调补偿。

DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS

摘要

著录项

相似文献

相关主题

期刊订阅