首页> 外文期刊>Signal Processing, IET >Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification
【24h】

Sentence-HMM state-based i-vector/PLDA modelling for improved performance in text dependent single utterance speaker verification

机译:基于Sentence-HMM状态的i-vector / PLDA建模可提高与文本相关的单个说话者说话人验证的性能

获取原文
获取原文并翻译 | 示例
       

摘要

In this paper, we make use of hidden Markov model (HMM) state alignment information in i-vector/probabilistic linear discriminant analysis (PLDA) framework to improve the verification performance in a text-dependent single utterance (TDSU) task. In the TDSU task, speakers repeat a fixed utterance in both enrollment and authentication sessions. Despite Gaussian mixture models (GMMs) have been the dominant modeling technique for text-independent applications, an HMM based method might be better suited for the TDSU task since it captures the co-articulation information better. Recently, powerful channel compensation techniques such as joint factor analysis (JFA), i-vectors and PLDA have been proposed for GMM based text-independent speaker verification. In this study, we train a separate i-vector/PLDA model for each sentence HMM state in order to utilize the alignment information of the HMM states in a TDSU task. The proposed method is tested using a multi-channel speaker verification database. In the experiments, it is observed that HMM state based i-vector/PLDA (i-vector/PLDA-HMM) provides approximately 67% relative reduction in equal error rate (EER) when compared to the i-vector/PLDA. The proposed method also outperforms the baseline GMM and sentence HMM methods. It yields approximately 51% relative reduction in EER over the best performing sentence HMM method.
机译:在本文中,我们在i-矢量/概率线性判别分析(PLDA)框架中使用了隐马尔可夫模型(HMM)状态对齐信息,以提高在文本相关的单言语(TDSU)任务中的验证性能。在TDSU任务中,演讲者会在注册和身份验证会话中重复固定的发音。尽管高斯混合模型(GMM)已成为独立于文本的应用程序的主要建模技术,但基于HMM的方法可能更好地适合TDSU任务,因为它可以更好地捕获共同发音信息。最近,针对基于GMM的独立于文本的说话者验证,已经提出了强大的信道补偿技术,例如联合因子分析(JFA),i矢量和PLDA。在这项研究中,我们为每个句子HMM状态训练一个单独的i-vector / PLDA模型,以便在TDSU任务中利用HMM状态的对齐信息。使用多通道扬声器验证数据库对提出的方法进行了测试。在实验中,观察到与i-vector / PLDA相比,基于HMM状态的i-vector / PLDA(i-vector / PLDA-HMM)提供了大约67%的均等错误率(EER)相对降低。所提出的方法也优于基线GMM和句子HMM方法。与性能最佳的句子HMM方法相比,它的EER相对降低了约51%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号