首页> 外文OA文献 >Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data
【2h】

Comparison of Generative and Discriminative Approaches for Speaker Recognition with Limited Data

机译:有限数据的说话人识别的生成方法和判别方法的比较

摘要

This paper presents a comparison of three different speaker recognition methods deployed in a broadcast news processing system. We focus on how the generative and discriminative nature of these methods affects the speaker recognition framework and we also deal with intersession variability compensation techniques in more detail, which are of great interest in broadcast processing domain. Performed experiments are specific particularly for the very limited amount of data used for both speaker enrollment (typically ranging from 30 to 60 seconds) and recognition (typically ranging from 5 to 15 seconds). Our results show that the system based on Gaussian Mixture Models (GMMs) outperforms both systems based on Support Vector Machines (SVMs) but its drawback is higher computational cost.
机译:本文对广播新闻处理系统中部署的三种不同的说话人识别方法进行了比较。我们关注这些方法的生成性和判别性如何影响说话者识别框架,并且我们还将更详细地处理会话间可变性补偿技术,这在广播处理领域非常重要。进行的实验特别针对说话人注册(通常为30至60秒)和识别(通常为5至15秒)所使用的非常有限的数据量。我们的结果表明,基于高斯混合模型(GMM)的系统优于基于支持向量机(SVM)的两个系统,但是其缺点是计算成本较高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号