Duration mismatch compensation for i-vector based speaker recognition systems

机译：基于i向量的说话人识别系统的持续时间不匹配补偿

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Speaker recognition systems trained on long duration utterances are known to perform significantly worse when short test segments are encountered. To address this mismatch, we analyze the effect of duration variability on phoneme distributions of speech utterances and i-vector length. We demonstrate that, as utterance duration is decreased, number of detected unique phonemes and i-vector length approaches zero in a logarithmic and non-linear fashion, respectively. Assuming duration variability as an additive noise in the i-vector space, we propose three different strategies for its compensation: i) multi-duration training in Probabilistic Linear Discriminant Analysis (PLDA) model, ii) score calibration using log duration as a Quality Measure Function (QMF), and iii) multi-duration PLDA training with synthesized short duration i-vectors. Experiments are designed based on the 2012 National Institute of Standards and Technology (NIST) Speaker Recognition Evaluation (SRE) protocol with varying test utterance duration. Experimental results demonstrate the effectiveness of the proposed schemes on short duration test conditions, especially with the QMF calibration approach.

机译：众所周知，经过长时间话语训练的说话人识别系统在遇到较短的测试片段时会表现得很差。为了解决这种不匹配问题，我们分析了持续时间变化对语音发声和i-vector长度的音素分布的影响。我们证明，随着发声持续时间的减少，检测到的唯一音素和i向量长度的数量分别以对数和非线性方式接近零。假设持续时间可变性是i向量空间中的附加噪声，我们提出了三种不同的补偿策略：i）概率线性判别分析（PLDA）模型中的多持续时间训练，ii）使用对数持续时间作为质量度量的评分校准功能（QMF），以及iii）使用合成的短期i-vector进行多持续时间PLDA训练。实验是根据2012年美国国家标准技术研究院（NIST）的说话者识别评估（SRE）协议设计的，测试说话的持续时间各不相同。实验结果证明了该方案在短期测试条件下的有效性，尤其是在QMF校准方法下。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2013年|7663-7667|共5页
会议地点
作者
Hasan Taufiq; Saeidi Rahim; Hansen John H.L.; van Leeuwen David A.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Speaker verification; i-vector; quality measure fusion (QMF); short utterance;

机译：说话人验证; i-vector;质量度量融合（QMF）;简短说话;

相似文献

外文文献
中文文献
专利

1. Duration compensation of i-vectors for short duration speaker verification [J] . Jianbo Ma, Vidhyasaharan Sethu, Eliathamby Ambikairajah, Electronics Letters . 2017,第6期

机译：i向量的持续时间补偿，用于短时说话者验证
2. I-vector based speaker recognition using advanced channel compensation techniques [J] . Ahilan Kanagasundaram, David Dean, Sridha Sridharan, Computer speech and language . 2014,第1期

机译：使用高级通道补偿技术的基于I矢量的说话人识别
3. Speaker Recognition With Random Digit Strings Using Uncertainty Normalized HMM-Based i-Vectors [J] . Maghsoodi Nooshin, Sameti Hossein, Zeinal Hossein, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第11期

机译：基于不确定性归一化HMM的i向量的带有随机数字字符串的说话人识别
4. DURATION MISMATCH COMPENSATION FOR I-VECTOR BASED SPEAKER RECOGNITION SYSTEMS [C] . Taufiq Hasan, Rahim Saeidi, John H.-L. Hansen, IEEE International Conference on Acoustics, Speech and Signal Processing . 2013

机译：基于I型向量的扬声器识别系统的持续时间不匹配补偿
5. Speaker Characteristic-based Acoustic Model Adaptation Method for Speaker Recognition Systems [D] . Millington, Daniel S. 2011

机译：基于说话者特征的说话人识别系统声学模型自适应方法
6. Recognition and repair of compound DNA lesions (base damage and mismatch) by human mismatch repair and excision repair systems. [O] . D Mu, M Tursun, D R Duckett, 1997

机译：通过人类错配修复和切除修复系统识别和修复复合DNA损伤（碱基损伤和错配）。
7. Duration mismatch compensation for i-vector based speaker recognition systems [O] . Hasan T., Saeidi R., Hanson J.H.L., 2013

机译：基于i向量的说话人识别系统的持续时间不匹配补偿
8. DOMAIN MISMATCH COMPENSATION FOR SPEAKER RECOGNITION USING A LIBRARY OF WHITENERS. [R] . Singer, E., Reynolds, D. A. 2015

机译：利用白人图书馆进行演讲者识别的域名失调补偿。

Duration mismatch compensation for i-vector based speaker recognition systems

摘要

著录项

相似文献

相关主题

期刊订阅