首页> 外文会议>IEEE International Conference on Acoustics, Speech and Signal Processing >JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition
【24h】

JFA modeling with left-to-right structure and a new backend for text-dependent speaker recognition

机译:JFA建模,具有从左到右的结构以及一个新的后端,用于文本相关的说话人识别

获取原文

摘要

This paper introduces a new formulation of Joint Factor Analysis (JFA) for text-dependent speaker recognition based on left-to-right modeling with tied mixture HMMs. It accommodates many different ways of extracting multiple features to characterize speakers (features may or may not be HMM state-dependent, they may be modeled with subspace or factorial priors and these priors maybe imputed from text-dependent or text-independent background data). We feed these features to a new, trainable classifier for text-dependent speaker recognition in a manner which is broadly analogous to the i-vector/PLDA cascade in text-independent speaker recognition. We have evaluated this approach on a challenging proprietary dataset consisting of telephone recordings of short English and Urdu pass-phrases collected in Pakistan. By fusing results obtained with multiple front ends, equal error rate of around 2% are achievable.
机译:本文介绍了一种新的联合因子分析(JFA)公式,用于基于从左到右建模并带有混合HMM的文本相关的说话人识别。它提供了多种提取多个特征以表征说话人的方式(特征可能取决于HMM状态,也可能不取决于HMM状态,可以使用子空间或阶乘先验进行建模,并且这些先验可以从与文本相关或与文本无关的背景数据中推算出来)。我们将这些功能提供给一个新的,可训练的分类器,以与文本无关的说话人识别大体上类似于i-vector / PLDA级联的方式来进行与文本有关的说话人识别。我们已经在具有挑战性的专有数据集上评估了这种方法,该数据集包括在巴基斯坦收集的英语和乌尔都语短短语的电话录音。通过融合从多个前端获得的结果,可以实现大约2%的相等错误率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号