首页> 外文会议>Asia-Pacific Signal and Information Processing Association Annual Summit and Conference >KL-Divergence based Mispronunciation Detection via DNN and Decision Tree in the Phonetic Space
【24h】

KL-Divergence based Mispronunciation Detection via DNN and Decision Tree in the Phonetic Space

机译:基于KL分歧的致电空间中的DNN和决策树的误用检测

获取原文

摘要

We propose to detect mispronunciations in a language learners speech via a discriminatively trained DNN in the phonetic space. The posterior probabilities of "senones" populated in a decision tree are trained and predicted speaker independently. Acoustic features of each input segment (with preceding and succeeding contexts of several frames) are mapped unto the whole set of senones in their corresponding posteriors. Vectors of senone posteriors are used as stochastic characterization of input speech segments in the phonetic space. Distortion between any two such vectors are measured with the symmetric Kullback-Leibler Divergence (KLD) and they are used for performing vector clustering and computing the corresponding centroids in a phonetically oriented senone based decision tree. Experimental results, tested on a large, Mandarin database (iCALL) of L2 language learners, show that the proposed approach to mispronunciation detection can achieve a 3.0% of equal precision and recall improvement over our best DNN-based, Goodness of Pronunciation (GOP) baseline system with adaptation. When the original maximum-likelihood trained decision tree is retrained with the symmetric KLD measure, further improvement of 0.8% of equal precision and recall can be obtained.
机译:我们建议通过语音空间中的差异训练的DNN检测语言学习者语音中的误片。在决策树中填充的“Senones”的后验概率是独立培训和预测扬声器的。每个输入段的声学特征(具有几个帧的前面和后续上下文)都将在其相应的后索中映射整组Senones。 enonone后后海后的载体被用作语音空间中输入语音段的随机表征。使用两个这样的矢量之间的失真与对称klullback-leibler发散(KLD)测量,并且它们用于执行向量聚类并计算在语音面向的基于Senone的决策树中的相应质心。实验结果,在L2语言学习者的大型普通话数据库(ICALL)上测试,表明所提出的错乱检测方法可以达到相同精度的3.0%,并回忆提高我们最好的DNN,发音优越(GOP)基线系统具有适应性。当用对称KLD测量恢复原始最大似然培训的决策树时,可以获得同等精度和召回的0.8%的进一步提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号