首页> 外文会议>2012 International Conference on Signal Processing and Communications >Sparse representation of total variability smoothed GMM mean supervectors for speaker verification
【24h】

Sparse representation of total variability smoothed GMM mean supervectors for speaker verification

机译:用于说话人验证的总变异平滑GMM平均超向量的稀疏表示

获取原文
获取原文并翻译 | 示例

摘要

The total variability i-vector based speaker verification system is one of the most successful systems in the recent NIST evaluations. It achieves significant improvement in performance over the conventional GMM-UBM based systems by using the projections of the GMM mean shifted supervectors to a low dimensional space for representation. This low dimensional projections are commonly referred to as the total variability i-vector features. In our recent works we have explored the use of sparse representation of the GMM mean shifted supervectors derived using a learned redundant dictionary as a feature for the speaker verification. This approach resulted in a performance comparable to that of the similar complexity i-vector based system. In this work, we explore a fusion of these two approaches in which the GMM mean supervectors are smoothed using the total variability space prior to creating dictionary for sparse representation. The proposed method is found to give a relative improvement of 19% in EER compared to that of the i-vector based system for the experiments done using the NIST 2003 SRE database.
机译:基于总可变性向量的说话人验证系统是最近NIST评估中最成功的系统之一。通过使用GMM均值偏移超向量的投影到低维空间进行表示,与常规的基于GMM-UBM的系统相比,它在性能上实现了重大改进。这种低维投影通常称为总可变性i向量特征。在我们最近的工作中,我们探索了使用GMM均值移位超向量的稀疏表示形式,该稀疏表示形式是使用学习到的冗余字典作为说话人验证的功能而得出的。这种方法的性能可与基于复杂度的基于i-vector的系统相媲美。在这项工作中,我们探索了这两种方法的融合,其中在创建稀疏表示字典之前,使用总可变空间对GMM均值超向量进行平滑处理。对于使用NIST 2003 SRE数据库进行的实验,发现与基于i-vector的系统相比,该方法在EER方面具有19%的相对改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号