首页> 外文会议>Chinese Automation Congress >Speaker Verification Using IMNMF and MFCC with Feature Warping Under Noisy Environment
【24h】

Speaker Verification Using IMNMF and MFCC with Feature Warping Under Noisy Environment

机译:在噪声环境下使用IMNMF和MFCC进行特征变形的扬声器验证

获取原文

摘要

The mismatch between the training conditions and the test conditions severely degrades the performance of speaker verification. Aiming at solving this problem, this paper presents a method which uses a combination of improved nonnegative matrix factorization (IMNMF) and mel frequency cepstral coefficients (MFCC) with feature warping for improving identity-vector (i-vector) speaker verification performance in noisy environment. Unlike the traditional nonnegative matrix factorization (NMF), IMNMF uses extra free basis vectors to capture the features which are not included in training data, and linear constraints on dictionary atoms. Feature warping is used to remove channel noises. Therefore, the proposed method can reduce distortion of reconstructed speech while enhancing the recovered speech quality. The performance of i-vector speaker verification is evaluated by using the short utterance database and the NOISEX-92 database. The experiment results indicate that the score level fusion of feature-warped IMNMF-MFCC and feature-warped MFCC is superior to the baseline system at the equal error rate under the majority of signal-to-noise ratios (SNRs).
机译:训练条件和测试条件之间的不匹配严重降低了说话者验证的性能。为了解决这个问题,本文提出了一种结合使用改进的非负矩阵分解(IMNMF)和梅尔频率倒谱系数(MFCC)以及特征扭曲的方法,以改善嘈杂环境中的身份向量(i-vector)说话者验证性能。与传统的非负矩阵分解(NMF)不同,IMMNF使用额外的自由基向量来捕获训练数据中未包含的特征以及对字典原子的线性约束。特征扭曲用于消除通道噪声。因此,提出的方法可以在提高恢复的语音质量的同时减少重构语音的失真。使用短话语数据库和NOISEX-92数据库评估i-vector说话者验证的性能。实验结果表明,在大多数信噪比(SNR)下,在相同的误码率下,特征扭曲的IMNMF-MFCC和特征扭曲的MFCC的得分水平融合优于基线系统。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号