首页> 外文期刊>International journal of speech technology >Speaker recognition based on short utterance compensation method of generative adversarial networks
【24h】

Speaker recognition based on short utterance compensation method of generative adversarial networks

机译:基于生成对抗网络短语补偿方法的扬声器识别

获取原文
获取原文并翻译 | 示例
           

摘要

On the basis of gaussian mixture model-universal background model (GMM-UBM) in the speaker recognition system, the paper proposes a short utterance sample compensation method based on the generative adversarial network (GAN) to solve the problem of the inadequate corpus data caused by short utterance, which has led to a serious reduction of recognition rate. The presented method compensates the short utterance samples into the speech samples with sufficient speaker identity information by completing the antagonistic training of generator network and discriminator network. In order to avoid the model crash and gradient instability in the process of GAN training, this paper adopts the condition information in the conditional GAN to guide the compensation process of the generator network, and proposes the generator compensation performance measurement training task and the feature tag training task of the discriminator to stabilize training process. Finally, the proposed short utterance compensation method is evaluated on the speaker recognition system based on GMM-UBM. The experimental results indicate that the presented method can effectively reduce the equal error rate of the speaker recognition system in short utterance environment.
机译:在扬声器识别系统中的高斯混合模型 - 通用背景模型(GMM-UBM)的基础上,本文提出了一种基于生成的对抗网络(GAN)的短发声样本补偿方法来解决导致的语料库数据不足的问题通过短语,这导致了识别率的严重降低。本方法通过完成发电机网络和鉴别器网络的拮抗训练,将短话语样本用足够的扬声器标识信息补偿到语音样本中。为了避免模型崩溃和梯度不稳定性在GaN培训过程中,本文采用条件GaN中的条件信息来指导发电机网络的补偿过程,并提出发电机补偿性能测量训练任务和功能标签判别培训任务稳定培训过程。最后,基于GMM-UBM的扬声器识别系统评估了所提出的短语补偿方法。实验结果表明,所提出的方法可以有效地降低短语环境中扬声器识别系统的相等误差率。

著录项

  • 来源
    《International journal of speech technology》 |2020年第2期|443-450|共8页
  • 作者单位

    College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

    College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

    College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

    College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

    College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

    College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Gaussian mixture model-universal background model; Speaker recognition; Generative adversarial network;

    机译:高斯混合模型 - 通用背景模型;发言人识别;生成对抗网络;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号