Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings

机译：带扬声器文本分解嵌入式扬声器验证的文本适配

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Text mismatch between pre-collected data, either training data or enrollment data, and the actual test data can significantly hurt text-dependent speaker verification (SV) system performance. Although this problem can be solved by carefully collecting data with the target speech content, such data collection could be costly and inflexible. In this paper, we propose a novel text adaptation framework to address the text mismatch issue. Here, a speaker-text factorization network is proposed to factorize the input speech into speaker embeddings and text embeddings and then integrate them into a single representation in the later stage. Given a small amount of speaker-independent adaptation utterances, text embeddings of target speech content can be extracted and used to adapt the text-independent speaker embeddings to text-customized speaker embeddings. Experiments on RSR2015 show that text adaptation can significantly improve the performance of text mismatch conditions.

机译：预收集数据之间的文本不匹配，培训数据或注册数据，以及实际测试数据可以显着损害文本依赖扬声器验证（SV）系统性能。虽然可以通过用目标语音内容仔细收集数据来解决这个问题，但这些数据收集可能是昂贵和不灵活的。在本文中，我们提出了一种新颖的文本适应框架来解决文本不匹配问题。这里，提出了一种扬声器文本分解网络，以将输入语置分解为扬声器嵌入和文本嵌入物，然后将它们集成到稍后阶段的单个表示中。鉴于少量的扬声器无关的适应话语，可以提取目标语音内容的文本嵌入，并用于将文本无关的扬声器嵌入式调整到文本定制的扬声器嵌入。 RSR2015上的实验表明，文本适应可以显着提高文本不匹配条件的性能。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|p6204-6823|共5页
会议地点
作者
Yexin Yang; Shuai Wang; Xun Gong; Yanmin Qian; Kai Yu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
speaker verification; text-dependent; text mismatch; adaptation;

机译：扬声器验证;文本依赖;文本不匹配;适应;

相似文献

外文文献
中文文献
专利

1. Speaker-Phrase-Specific Adaptation of PLDA Model for Improved Performance in Text-Dependent Speaker Verification [J] . Laskar Mohammad Azharuddin, Bhanja Chuya China, Laskar Rabul Hussain Circuits, systems and signal processing . 2021,第10期

机译：PLDA模型的扬声器 - 短语特定调整，提高文本依赖扬声器验证中的性能
2. Variational DNN embeddings for text-independent speaker verification [J] . Pinheiro Hector N. B., Ren Tsang Ing, Adami Andre G., Pattern recognition letters . 2021,第Auga期

机译：变形DNN嵌入文本独立扬声器验证
3. Discriminative Neural Embedding Learning for Short-Duration Text-Independent Speaker Verification [J] . Wang Shuai, Huang Zili, Qian Yanmin, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2019,第11期

机译：区分性神经嵌入学习用于短时文本无关的说话人验证
4. Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings [C] . Yexin Yang, Shuai Wang, Xun Gong, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：使用说话人文本分解嵌入进行说话人验证的文本自适应
5. Speaker adaptation in joint factor analysis based text independent speaker verification [D] . Shou-Chun, Yin 2007

机译：基于联合因素分析的文本自适应说话人验证中的说话人适应
6. Bidirectional Attention for Text-Dependent Speaker Verification [O] . Xin Fang, Tian Gao, Liang Zou, 2020

机译：文本依赖扬声器验证的双向关注
7. Cross-lingual Text-independent Speaker Verification Using Unsupervised Adversarial Discriminative Domain Adaptation [O] . Wei Xia, Jing Huang, John H.L. Hansen 2019

机译：使用无监督的对冲歧视域适应交叉语言无关的扬声器验证

Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings

摘要

著录项

相似文献

相关主题

期刊订阅