Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning

Rao W.; Mak M.-W.

首页> 外文期刊>Audio, Speech, and Language Processing, IEEE Transactions on >Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning

【24h】

Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning

机译：通过话语分割提高基于I矢量的说话人验证性能

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The success of the recent i-vector approach to speaker verification relies on the capability of i-vectors to capture speaker characteristics and the subsequent channel compensation methods to suppress channel variability. Typically, given an utterance, an i-vector is determined from the utterance regardless of its length. This paper investigates how the utterance length affects the discriminative power of i-vectors and demonstrates that the discriminative power of i-vectors reaches a plateau quickly when the utterance length increases. This observation suggests that it is possible to make the best use of a long conversation by partitioning it into a number of sub-utterances so that more i-vectors can be produced for each conversation. To increase the number of sub-utterances without scarifying the representation power of the corresponding i-vectors, repeated applications of frame-index randomization and utterance partitioning are performed. Results on NIST 2010 speaker recognition evaluation (SRE) suggest that (1) using more i-vectors per conversation can help to find more robust linear discriminant analysis (LDA) and within-class covariance normalization (WCCN) transformation matrices, especially when the number of conversations per training speaker is limited; and (2) increasing the number of i-vectors per target speaker helps the i-vector based support vector machines (SVM) to find better decision boundaries, thus making SVM scoring outperforms cosine distance scoring by 19% and 9% in terms of minimum normalized DCF and EER.

机译：最近的i-vector说话人验证方法的成功取决于i-vector捕捉说话人特征的能力以及随后的通道补偿方法来抑制通道变化。通常，给定发声，无论发声的长度如何，都从发声确定i向量。本文研究了话语长度如何影响i向量的判别能力，并证明了当言语长度增加时，i向量的判别能力会迅速达到平稳状态。该观察结果表明，可以通过将长对话划分为多个子话语来充分利用长对话，从而可以为每个对话生成更多的i-vector。为了增加子话语的数量而不牺牲相应i向量的表示能力，执行了帧索引随机化和话语划分的重复应用。 NIST 2010说话者识别评估（SRE）的结果表明（1）在每次对话中使用更多i-vector可以帮助找到更鲁棒的线性判别分析（LDA）和类内协方差归一化（WCCN）转换矩阵，尤其是当每个培训讲者的对话次数有限；（2）增加每个目标说话者的i向量的数量有助于基于i向量的支持向量机（SVM）找到更好的决策边界，从而使SVM评分的余弦距离评分比最小余弦评分高19％和9％归一化DCF和EER。

著录项

来源
《Audio, Speech, and Language Processing, IEEE Transactions on》 |2013年第5期|p.1012-1022|共11页
作者
Rao W.; Mak M.-W.;
展开▼
作者单位

Department of Electronic and Information Engineering, The Hong Kong Polytechnic University, Hong Kong,;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Acoustics; Interviews; NIST; Speech; Support vector machines; Training; Vectors; I-vectors; linear discriminant analysis; speaker verification; support vector machines; utterance partitioning with acoustic vector resampling (UP-AVR);

机译：声学;面试;NIST;言语;支持向量机;训练;向量;I向量线性判别分析;说话人验证;支持向量机;使用声学矢量重采样（UP-AVR）进行话语分割;

相似文献

外文文献
中文文献
专利

1. Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification [J] . Osman Büyük Signal Processing, IET . 2016,第8期

机译：基于Sentence-HMM状态的i-vector / PLDA建模可提高与文本相关的单个说话者说话人验证的性能
2. Improved i-vector extraction technique for speaker verification with short utterances [J] . Arnab Poddar, Md Sahidullah, Goutam Saha International journal of speech technology . 2018,第3期

机译：改进的i矢量提取技术，用于说话者短话验证
3. Non-speaker information reduction from Cosine Similarity Scoring in i-vector based speaker verification [J] . Zeinali Hossein, Mirian Alireza, Sameti Hossein, Computers and Electrical Engineering . 2015,第Null期

机译：基于i向量的说话人验证中基于余弦相似性评分的非说话人信息约简
4. GMM and i-vector based speaker verification using speaker-specific-text for short utterances [C] . Bharathi B., Nagarajan T. IEEE Region 10 Conference . 2013

机译：基于GMM和i-vector的说话人验证，使用说话人特定的文本进行简短说话
5. Speaker adaptation in joint factor analysis based text independent speaker verification [D] . Shou-Chun, Yin 2007

机译：基于联合因素分析的文本自适应说话人验证中的说话人适应
6. Short-time speaker verification with different speaking style utterances [O] . Hongwei Mao, Yan Shi, Yue Liu, 2020

机译：短时间发言者验证不同的说话风格的话语
7. Boosting the performance of I-vector based speaker verification via utterance partitioning [O] . Rao, W, Mak, MW 2013

机译：通过话语划分提高基于I矢量的说话人验证的性能

Boosting the Performance of I-Vector Based Speaker Verification via Utterance Partitioning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅