End-to-End Feature Learning for Text-Independent Speaker Verification

机译：端到端特征学习，用于独立于文本的说话者验证

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep neural networks (DNNs) have found widespread use in text-independent speaker verification, especially the convolutional models with triplet loss. However, the training efficiency and the quality of learned features are not sufficiently good. In this paper, we present an end-to-end framework to train speaker verification models efficiently. In details, we introduce redesigned residual blocks in neural network architecture and propose a way of selecting hard triplets to improve original triplet loss function. Furthermore, the effects of hyperparameters and framing strategy in input pipeline are investigated for fine-tuning. Experimental results on the Librispeech and AISHELL-2 datasets demonstrate that the proposed method can reduce the verification equal error rate by greater than 20% relatively, which confirms the advantage of proposed methods comparing to methods in previous work.

机译：深度神经网络（DNN）已在与文本无关的说话者验证中得到广泛使用，尤其是具有三重态损失的卷积模型。但是，训练效率和学习特征的质量不够好。在本文中，我们提出了一个端到端框架来有效地训练说话者验证模型。详细地，我们在神经网络体系结构中介绍了重新设计的残差块，并提出了一种选择硬三联体以改善原始三重态损失函数的方法。此外，研究了超参数和成帧策略在输入管道中的影响，以进行微调。在Librispeech和AISHELL-2数据集上的实验结果表明，所提出的方法可以将验证均等错误率相对降低20％以上，这证实了所提出方法与以前工作相比的优势。

著录项

来源
《Chinese Control and Decision Conference》|2019年|3949-3954|共6页
会议地点
作者
Fangzhou Chen; Tengyue Bian; Li Xu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Feature extraction; Neural networks; Microsoft Windows; Pipelines; Analytical models; Computational modeling; Computer architecture;

机译：特征提取;神经网络; Microsoft Windows;管道;分析模型;计算模型;计算机体系结构;

相似文献

外文文献
中文文献
专利

1. Enhancement of a text-independent speaker verification system by using feature combination and parallel structure classifiers [J] . Abdalmalak Kerlos Atia, Gallardo-Antolin Ascension Neural computing & applications . 2018,第3期

机译：使用特征组合和并行结构分类器来增强文本独立的扬声器验证系统
2. Delta-MFCC Features and Information Theoretic Expectation Maximization based Text-independent Speaker Verification System [J] . Sheeraz Memon, Imran Ali Jokhio, Sana Hoor Arisar, IETE Journal of Research . 2012,第1期

机译：基于Delta-MFCC特征和信息理论期望最大化的基于文本的说话人验证系统
3. Ant colony optimization-based selected features for Text-independent speaker verification [J] . Hunny Pahuja, Jitender Chhabra, Ajay Khokhar International Journal of Engineering Research and Applications . 2012,第3期

机译：基于蚁群优化的选定功能，用于独立于文本的说话者验证
4. End-to-End Feature Learning for Text-Independent Speaker Verification [C] . Fangzhou Chen, Tengyue Bian, Li Xu Chinese Control and Decision Conference . 2019

机译：关于无关的扬声器验证的端到端特征学习
5. Text-Independent Speaker Identification using Statistical Learning [D] . Ojutiku, Alli Ayoola. 2015

机译：使用统计学习的与文本无关的说话人识别
6. Can We Ditch Feature Engineering? End-to-End Deep Learning for Affect Recognition from Physiological Sensor Data [O] . Maciej Dzieżyc, Martin Gjoreski, Przemysław Kazienko, 2020

机译：我们可以挖掘功能工程吗？从生理传感器数据的影响识别的端到端深度学习
7. Deep Speaker Feature Learning for Text-independent Speaker Verification [O] . Li, Lantian, Chen, Yixiang, Shi, Ying, 2017

机译：深度扬声器功能学习，用于独立于文本的扬声器验证

End-to-End Feature Learning for Text-Independent Speaker Verification

摘要

著录项

相似文献

相关主题

期刊订阅