Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech

机译：将发音特征整合到声学语音模型中以英语二语语音的误音检测和诊断

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper proposes novel approaches to mispronunciation detection and diagnosis (MDD) on second-language (L2) learners' speech with articulatory features. Here, articulatory features are the positions of articulators when pronouncing phonemes and reflect the pronunciation mechanisms of each phoneme. The use of articulatory features in MDD is helpful in distinguishing phonemes. Three models with articulatory features are proposed based on acoustic-phonemic model (APM): 1) articulatory-acoustic-phonemic model (AAPM) that embeds articulatory features directly into input features; 2) AAPM with feature representation (R-AAPM) to represent original input features with articulatory features; and 3) articulatory multi-task acoustic-phonemic model (A-MT-APM) where phoneme recognizer and articulatory feature classifiers are trained simultaneously in multi-task manner. Compared with baseline phoneme-based APM, proposed approaches perform better in mispronunciation detection and diagnosis measured with Precision, Recall and F1-Measure metrics. Specifically, the A-MT-APM approach gains 5.6% and 7.0% improvement in F1-Measure and diagnostic accuracy respectively. The contributions include: 1) introducing the articulatory features to MDD in deep learning framework; 2) investigating several model architectures for better exploiting articulatory features.

机译：本文提出了一种新的方法来对具有发音特征的第二语言（L2）学习者的语音进行错误发音检测和诊断（MDD）。这里，发音特征是发音音素时发音器的位置，并反映每个音素的发音机制。在MDD中使用发音特征有助于区分音素。基于声学语音模型（APM），提出了三种具有发音特征的模型：1）发音声学模型（AAPM），它将发音特征直接嵌入到输入特征中; 2）具有特征表示的AAPM（R-AAPM）代表具有发音特征的原始输入特征; 3）发音多任务声学音素模型（A-MT-APM），其中以多任务方式同时训练音素识别器和发音特征分类器。与基于基线音素的APM相比，提出的方法在用Precision，Recall和F1-Measure指标测量的错误发音检测和诊断中表现更好。具体来说，A-MT-APM方法在F1-Measure和诊断准确性方面分别提高了5.6％和7.0％。所做的贡献包括：1）在深度学习框架中为MDD引入发音特征; 2）研究几种模型架构，以更好地利用发音特征。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2018年|1-6|共6页
会议地点
作者
Shaoguang Mao; Zhiyong Wu; Xu Li; Runnan Li; Xixin Wu; Helen Meng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hidden Markov models; Acoustics; Phonetics; Task analysis; Computational modeling; Training;

机译：隐马尔可夫模型;声学;语音;任务分析;计算模型;训练;

相似文献

外文文献
中文文献
专利

1. Efficient personalized mispronunciation detection of Taiwanese-accented English speech based on unsupervised model adaptation and dynamic sentence selection [J] . Chung-Hsien Wu, Hung-Yu Su, Chao-Hong Liu Computer assisted language learning . 2013,第5期

机译：基于无监督模型自适应和动态句子选择的有效的个性化英语口音发音
2. Integrating Articulatory Features Into HMM-Based Parametric Speech Synthesis [J] . Zhen-Hua Ling, Richmond K., Yamagishi J., Audio, Speech, and Language Processing, IEEE Transactions on . 2009,第6期

机译：将发音特征集成到基于HMM的参数语音合成中
3. Articulatory feature based continuous speech recognition using probabilistic lexical modeling [J] . Ramya Rasipuram, Mathew Magimai.-Doss Computer speech and language . 2016,第Mara期

机译：基于发音特征的概率词汇建模的连续语音识别
4. Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech [C] . Shaoguang Mao, Zhiyong Wu, Xu Li, IEEE International Conference on Multimedia and Expo . 2018

机译：将铰接性特征集成到声学模型中，用于L2英语演讲中的错误发正检测和诊断
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. Deep Learning-Based Detection of Articulatory Features in Arabic and English Speech [O] . Mohammed Algabri, Hassan Mathkour, Mansour M. Alsulaiman, 2021

机译：基于深入的学习的阿拉伯语和英语演讲中的明晰度特征检测
7. Mispronunciation Detection and Diagnosis in L2 English Speech Using Multi-Distribution Deep Neural Networks [O] . Kun Li, Helen Meng 2015

机译：基于多分布深度神经网络的L2英语语音错误发音检测与诊断
8. Speech Recognition, Articulatory Feature Detection, and Speech Synthesis in Multiple Languages [R] . Ore, B. M. 2009

机译：语音识别，发音特征检测和多语言语音合成

Integrating Articulatory Features into Acoustic-Phonemic Model for Mispronunciation Detection and Diagnosis in L2 English Speech

摘要

著录项

相似文献

相关主题

期刊订阅