首页> 外文OA文献 >Dialect Recognition Using a Phone-GMM-Supervector-Based SVM Kernel: Presentation Slides
【2h】

Dialect Recognition Using a Phone-GMM-Supervector-Based SVM Kernel: Presentation Slides

机译:使用基于Phone-GMM-Supervector的SVM内核的方言识别:演示幻灯片

摘要

In this paper, we introduce a new approach to dialect recognition which relies on the hypothesis that certain phones are realized differently across dialects. Given a speaker’s utterance, we first obtain the most likely phone sequence using a phone recognizer. We then extract GMM Supervectors for each phone instance. Using these vectors, we design a kernel function that computes the similarities of phones between pairs of utterances. We employ this kernel to train SVM classifiers that estimate posterior probabilities, used during recognition. Testing our approach on four Arabic dialects from 30s cuts, we compare our performance to five approaches: PRLM; GMM-UBM; our own improved version of GMM-UBM which employs fMLLR adaptation; our recent discriminative phonotactic approach; and a state-of-the-art system: SDC-based GMM-UBM discriminatively trained. Our kernel-based technique outperforms all these previous approaches; the overall EER of our system is 4.9%.
机译:在本文中,我们介绍了一种新的方言识别方法,该方法基于以下假设:某些电话在方言中的实现方式有所不同。考虑到说话者的话语,我们首先使用电话识别器获得最可能的电话序列。然后,我们为每个电话实例提取GMM超向量。使用这些向量,我们设计了一个内核函数,用于计算发声对之间的电话相似度。我们使用此内核来训练SVM分类器,该分类器估计在识别过程中使用的后验概率。测试从30年代开始对四种阿拉伯方言进行的处理,我们将性能与五种方法进行比较:PRLM; GMM-UBM;我们自己的改进版GMM-UBM,采用了fMLLR自适应技术;我们最近的辨别音韵方法;以及最先进的系统:经过严格训练的基于SDC的GMM-UBM。我们基于内核的技术优于所有以前的方法;我们系统的整体EER为4.9%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号