首页> 外文会议>Chinese Automation Congress >Speaker verification using IMNMF and MFCC with feature warping under noisy environment
【24h】

Speaker verification using IMNMF and MFCC with feature warping under noisy environment

机译:使用IMNMF和MFCC的扬声器验证,具有嘈杂环境下的功能翘曲

获取原文

摘要

The mismatch between the training conditions and the test conditions severely degrades the performance of speaker verification. Aiming at solving this problem, this paper presents a method which uses a combination of improved nonnegative matrix factorization (IMNMF) and mel frequency cepstral coefficients (MFCC) with feature warping for improving identity-vector (i-vector) speaker verification performance in noisy environment. Unlike the traditional nonnegative matrix factorization (NMF), IMNMF uses extra free basis vectors to capture the features which are not included in training data, and linear constraints on dictionary atoms. Feature warping is used to remove channel noises. Therefore, the proposed method can reduce distortion of reconstructed speech while enhancing the recovered speech quality. The performance of i-vector speaker verification is evaluated by using the short utterance database and the NOISEX-92 database. The experiment results indicate that the score level fusion of feature-warped IMNMF-MFCC and feature-warped MFCC is superior to the baseline system at the equal error rate under the majority of signal-to-noise ratios (SNRs).
机译:培训条件和测试条件之间的不匹配严重降低了扬声器验证的性能。旨在解决这个问题,本文介绍了一种方法,它使用改进的非负矩阵分解(IMNMF)和MEL频率谱系数(MFCC)的组合,其中具有用于改善嘈杂环境中的Identity-Vector(I-Vector)扬声器验证性能的特征翘曲。与传统的非负矩阵分组(NMF)不同,IMNMF使用额外的自由基向量来捕获不包括在训练数据中的特征,以及字典原子的线性约束。功能翘曲用于删除频道噪声。因此,所提出的方法可以减少重建语音的失真,同时增强恢复的语音质量。通过使用短话语数据库和COUNTX-92数据库来评估I - 矢量扬声器验证的性能。实验结果表明,特征扭曲的IMNMF-MFCC和特征翘曲MFCC的分数水平融合优于基线系统,以大多数信噪比比(SNRS)下的相等误差率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号