首页> 外文会议>IEEE International Conference on Multimedia and Expo >Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech
【24h】

Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech

机译:将发音特征整合到声学语音模型中以英语二语语音的误音检测和诊断

获取原文

摘要

This paper proposes novel approaches to mispronunciation detection and diagnosis (MDD) on second-language (L2) learners' speech with articulatory features. Here, articulatory features are the positions of articulators when pronouncing phonemes and reflect the pronunciation mechanisms of each phoneme. The use of articulatory features in MDD is helpful in distinguishing phonemes. Three models with articulatory features are proposed based on acoustic-phonemic model (APM): 1) articulatory-acoustic-phonemic model (AAPM) that embeds articulatory features directly into input features; 2) AAPM with feature representation (R-AAPM) to represent original input features with articulatory features; and 3) articulatory multi-task acoustic-phonemic model (A-MT-APM) where phoneme recognizer and articulatory feature classifiers are trained simultaneously in multi-task manner. Compared with baseline phoneme-based APM, proposed approaches perform better in mispronunciation detection and diagnosis measured with Precision, Recall and F1-Measure metrics. Specifically, the A-MT-APM approach gains 5.6% and 7.0% improvement in F1-Measure and diagnostic accuracy respectively. The contributions include: 1) introducing the articulatory features to MDD in deep learning framework; 2) investigating several model architectures for better exploiting articulatory features.
机译:本文提出了一种新的方法来对具有发音特征的第二语言(L2)学习者的语音进行错误发音检测和诊断(MDD)。这里,发音特征是发音音素时发音器的位置,并反映每个音素的发音机制。在MDD中使用发音特征有助于区分音素。基于声学语音模型(APM),提出了三种具有发音特征的模型:1)发音声学模型(AAPM),它将发音特征直接嵌入到输入特征中; 2)具有特征表示的AAPM(R-AAPM)代表具有发音特征的原始输入特征; 3)发音多任务声学音素模型(A-MT-APM),其中以多任务方式同时训练音素识别器和发音特征分类器。与基于基线音素的APM相比,提出的方法在用Precision,Recall和F1-Measure指标测量的错误发音检测和诊断中表现更好。具体来说,A-MT-APM方法在F1-Measure和诊断准确性方面分别提高了5.6%和7.0%。所做的贡献包括:1)在深度学习框架中为MDD引入发音特征; 2)研究几种模型架构,以更好地利用发音特征。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号