首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings
【24h】

Text Adaptation for Speaker Verification with Speaker-Text Factorized Embeddings

机译:带扬声器文本分解嵌入式扬声器验证的文本适配

获取原文

摘要

Text mismatch between pre-collected data, either training data or enrollment data, and the actual test data can significantly hurt text-dependent speaker verification (SV) system performance. Although this problem can be solved by carefully collecting data with the target speech content, such data collection could be costly and inflexible. In this paper, we propose a novel text adaptation framework to address the text mismatch issue. Here, a speaker-text factorization network is proposed to factorize the input speech into speaker embeddings and text embeddings and then integrate them into a single representation in the later stage. Given a small amount of speaker-independent adaptation utterances, text embeddings of target speech content can be extracted and used to adapt the text-independent speaker embeddings to text-customized speaker embeddings. Experiments on RSR2015 show that text adaptation can significantly improve the performance of text mismatch conditions.
机译:预收集数据之间的文本不匹配,培训数据或注册数据,以及实际测试数据可以显着损害文本依赖扬声器验证(SV)系统性能。 虽然可以通过用目标语音内容仔细收集数据来解决这个问题,但这些数据收集可能是昂贵和不灵活的。 在本文中,我们提出了一种新颖的文本适应框架来解决文本不匹配问题。 这里,提出了一种扬声器文本分解网络,以将输入语置分解为扬声器嵌入和文本嵌入物,然后将它们集成到稍后阶段的单个表示中。 鉴于少量的扬声器无关的适应话语,可以提取目标语音内容的文本嵌入,并用于将文本无关的扬声器嵌入式调整到文本定制的扬声器嵌入。 RSR2015上的实验表明,文本适应可以显着提高文本不匹配条件的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号