Gaussian-constrained Training for Speaker Verification

机译：高斯约束讲台验证培训

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Neural models, in particular the d-vector and x-vector architectures, have produced state-of-the-art performance on many speaker verification tasks. However, two potential problems of these neural models deserve more investigation. Firstly, both models suffer from 'information leak', which means that some parameters participating in model training will be discarded during inference, i.e, the layers that are used as the classifier. Secondly, these models do not regulate the distribution of the derived speaker vectors. This 'unconstrained distribution' may degrade the performance of the subsequent scoring component, e.g., PLDA. This paper proposes a Gaussian-constrained training approach that (1) discards the parametric classifier, and (2) enforces the distribution of the derived speaker vectors to be Gaussian. Our experiments on the VoxCeleb and SITW databases demonstrated that this new training approach produced more representative and regular speaker embeddings, leading to consistent performance improvement.

机译：神经模型，特别是D形矢量和X-向量架构，在许多扬声器验证任务上产生了最先进的性能。然而，这些神经模型的两个潜在问题值得更多的调查。首先，两种模型都遭受了“信息泄漏”，这意味着在推理期间将丢弃参与模型训练的一些参数，即用作分类器的层。其次，这些模型不调节派生扬声器矢量的分布。这种“无约束分布”可能会降低随后的评分组分，例如PLDA。本文提出了高斯受约束的训练方法，（1）丢弃参数分类器，（2）强制推导扬声器向量的分布为高斯。我们对VoxceleB和SITW数据库的实验表明，这种新的培训方法产生了更多代表性和规则的扬声器嵌入，导致始终如一的性能改进。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2019年|p5996-6664|共5页
会议地点
作者
Lantian Li; Zhiyuan Tang; Ying Shi; Dong Wang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
speaker verification; deep neural network;

机译：扬声器验证;深神经网络;

相似文献

外文文献
中文文献
专利

1. Speaker adaptations in sparse training data for improved speaker verification [J] . Sungjoo Ahn, Hanseok Ko Electronics Letters . 2000,第4期

机译：稀疏训练数据中的说话人适应性改善了说话人验证
2. Improving the characterization of the alternative hypothesis via minimum verification error training with applications to speaker verification [J] . Chao YH, Tsai WH, Wang HM, Pattern Recognition: The Journal of the Pattern Recognition Society . 2009,第7期

机译：通过最小限度的验证错误训练（适用于说话者验证）改善替代假设的特征
3. A Cluster Adaptive Training Algorithm for Text-Independent Speaker Verification with Sparse Training Data [J] . Tao Jiang, Jiqing Han Advanced Science Letters . 2012,第1期

机译：具有稀疏训练数据的文本独立说话人验证的群集自适应训练算法
4. Gaussian-constrained Training for Speaker Verification [C] . Lantian Li, Zhiyuan Tang, Ying Shi, IEEE International Conference on Acoustics, Speech and Signal Processing . 2019

机译：高斯约束的说话人验证训练
5. Efficient methods for rapid UBM training (RUT) for robust speaker verification. [D] . Chandrasekaran, Aravind. 2008

机译：快速的UBM训练（RUT）的有效方法，用于可靠的说话人验证。
6. Short-time speaker verification with different speaking style utterances [O] . Hongwei Mao, Yan Shi, Yue Liu, 2020

机译：短时间发言者验证不同的说话风格的话语
7. Gaussian-constrained Training for Speaker Verification [O] . Lantian Li, Zhiyuan Tang, Ying Shi, 2019

机译：高斯约束讲台验证培训
8. Tests Results Advanced Development Models of BISS Identity Verification Equipment. Volume II. Automatic Speaker Verification. [R] . foodman,martin j. 1978

机译：测试结果BIss身份验证设备的高级开发模型。第二卷。自动扬声器验证。

Gaussian-constrained Training for Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅