首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification
【24h】

Group nonnegative matrix factorisation with speaker and session variability compensation for speaker identification

机译:具有说话人和会话可变性补偿的群组非负矩阵分解,用于说话人识别

获取原文

摘要

This paper presents a feature learning approach for speaker identification that is based on nonnegative matrix factorisation. Recent studies have shown that with such models, the dictionary atoms can represent well the speaker identity. The approaches proposed so far focused only on speaker variability and not on session variability. However, this later point is a crucial aspect in the success of the I-vector approach that is now the state-of-the-art in speaker identification. This paper proposes a method that relies on group nonnegative matrix factorisation and that is inspired by the I-vector training procedure. By doing so the proposed approach intends to capture both the speaker variability and the session variability. Results on a small corpus prove that the proposed approach can be competitive with I-vectors.
机译:本文提出了一种基于非负矩阵分解的说话人识别特征学习方法。最近的研究表明,使用这种模型,词典原子可以很好地代表说话者的身份。到目前为止,所提出的方法仅关注说话者的可变性,而不关注会话的可变性。然而,这一点是I矢量方法成功的关键方面,而I矢量方法现在已成为说话人识别的最新技术。本文提出了一种基于组非负矩阵分解的方法,该方法受I矢量训练过程的启发。通过这样做,所提出的方法旨在捕获说话者的可变性和会话的可变性。在一个小语料库上的结果证明,所提出的方法可以与I-vector竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号