Speaker recognition based on short utterance compensation method of generative adversarial networks

Zhangfang Hu; Yaqin Fu; Yuan Luo; Xuan Xu; Zhiguang Xia; Hongwei Zhang

首页> 外文期刊>International journal of speech technology >Speaker recognition based on short utterance compensation method of generative adversarial networks

【24h】

Speaker recognition based on short utterance compensation method of generative adversarial networks

机译：基于生成对抗网络短语补偿方法的扬声器识别

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

On the basis of gaussian mixture model-universal background model (GMM-UBM) in the speaker recognition system, the paper proposes a short utterance sample compensation method based on the generative adversarial network (GAN) to solve the problem of the inadequate corpus data caused by short utterance, which has led to a serious reduction of recognition rate. The presented method compensates the short utterance samples into the speech samples with sufficient speaker identity information by completing the antagonistic training of generator network and discriminator network. In order to avoid the model crash and gradient instability in the process of GAN training, this paper adopts the condition information in the conditional GAN to guide the compensation process of the generator network, and proposes the generator compensation performance measurement training task and the feature tag training task of the discriminator to stabilize training process. Finally, the proposed short utterance compensation method is evaluated on the speaker recognition system based on GMM-UBM. The experimental results indicate that the presented method can effectively reduce the equal error rate of the speaker recognition system in short utterance environment.

机译：在扬声器识别系统中的高斯混合模型 - 通用背景模型（GMM-UBM）的基础上，本文提出了一种基于生成的对抗网络（GAN）的短发声样本补偿方法来解决导致的语料库数据不足的问题通过短语，这导致了识别率的严重降低。本方法通过完成发电机网络和鉴别器网络的拮抗训练，将短话语样本用足够的扬声器标识信息补偿到语音样本中。为了避免模型崩溃和梯度不稳定性在GaN培训过程中，本文采用条件GaN中的条件信息来指导发电机网络的补偿过程，并提出发电机补偿性能测量训练任务和功能标签判别培训任务稳定培训过程。最后，基于GMM-UBM的扬声器识别系统评估了所提出的短语补偿方法。实验结果表明，所提出的方法可以有效地降低短语环境中扬声器识别系统的相等误差率。

著录项

来源
《International journal of speech technology》 |2020年第2期|443-450|共8页
作者
Zhangfang Hu; Yaqin Fu; Yuan Luo; Xuan Xu; Zhiguang Xia; Hongwei Zhang;
展开▼
作者单位

College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

College of Optoelectronic Engineering Chongqing University of Posts and Telecommunications Chongqing 400065 China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Gaussian mixture model-universal background model; Speaker recognition; Generative adversarial network;

机译：高斯混合模型 - 通用背景模型;发言人识别;生成对抗网络;

相似文献

外文文献
中文文献
专利

1. Forensic speaker recognition: A new method based on extracting accent and language information from short utterances [J] . Saleem Sajid, Subhan Fazli, Naseer Noman, Digital investigation . 2020,第Sepa期

机译：法医扬声器识别：一种基于简短话语中提取重音和语言信息的新方法
2. Person-independent facial expression recognition method based on improved Wasserstein generative adversarial networks in combination with identity aware [J] . Multimedia Systems . 2020,第1期

机译：基于改进的Wasserstein生成对抗网络并结合身份识别的独立于人的面部表情识别方法
3. Multi-reservoirs EEG signal feature sensing and recognition method based on generative adversarial networks [J] . Dong Yindong, Ren Fuji Computer Communications . 2020,第Deca期

机译：基于生成对策网络的多水库EEG信号特征感测和识别方法
4. Generative Adversarial Network Based i-Vector Transformation for Short Utterance Speaker Verification [C] . Jiacen Zhang, Nakamasa Inoue, Koichi Shinoda 日本音響学会;日本音響学会秋季研究発表会 . 2018

机译：基于生成对抗网络的i-Vector变换，用于说话者短话验证
5. Improved Speech Enhancement Algorithm based on Generative Adversarial Networks [D] . Wang, Kebei. 2021

机译：基于生成对抗性网络的改进语音增强算法
6. Generative Adversarial Networks-Based Semi-Supervised Automatic Modulation Recognition for Cognitive Radio Networks [O] . Mingxuan Li, Ou Li, Guangyi Liu, 2018

机译：基于生成对抗网络的认知无线电网络半监督自动调制识别
7. I-vector Transformation Using Conditional Generative Adversarial Networks for Short Utterance Speaker Verification [O] . Jiacen Zhang, Nakamasa Inoue, Koichi Shinoda 2018

机译：i - 矢量转换使用条件生成的对抗网络短语扬声器验证
8. Fusing Discriminative and Generative Methods for Speaker Recognition: Experiments on Switchboard and NFI/TNO Field Data. [R] . Campbell, W. M., Reynolds, D. A., Campbell, J. P. 2016

机译：融合说话人识别的判别和生成方法：交换机和NFI / TNO现场数据的实验。

Speaker recognition based on short utterance compensation method of generative adversarial networks

摘要

著录项

相似文献

相关主题

期刊订阅