SED-MDD: Towards Sentence Dependent End-To-End Mispronunciation Detection and Diagnosis

机译：SED-MDD：依赖于句子的端到端错误诊断和诊断

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

A mispronunciation detection and diagnosis (MD&D) system typically consists of multiple stages, such as an acoustic model, a language model and a Viterbi decoder. In order to integrate these stages, we propose SED-MDD, an end-to-end model for sentence dependent mispronunciation detection and diagnosis (MD&D) . Our proposed model takes mel-spectrogram and characters as inputs and outputs the corresponding phone sequence. Our experiments prove that SED-MDD can implicitly learn the phonological rules in both acoustic and linguistic features directly from the phonological annotation and transcription in the training data. To the best of our knowledge, SED-MDD is the first model of its kind and it achieves an accuracy of 86.35% and a correctness of 88.61% on L2-ARCTIC which significantly outperforms the existing end-to-end mispronunciation detection and diagnosis (MD&D) model CNN-RNN-CTC.

机译：错误发音检测和诊断（MD＆D）系统通常由多个阶段组成，例如声学模型，语言模型和维特比解码器。为了整合这些阶段，我们提出SED-MDD，用于句子依赖性错误发布检测和诊断的端到端模型（MD＆D）。我们所提出的模型将MEL-谱图和字符作为输入输出并输出相应的电话序列。我们的实验证明，SED-MDD可以直接从训练数据中的语音注释和转录中含蓄地学习声学和语言特征中的语音规则。据我们所知，SED-MDD是它的第一个模型，它的准确性为86.35％，正确性为88.61％，在L2-arctic上具有88.61％，显着优于现有的端到最终的误用检测和诊断（ MD＆D）模型CNN-RNN-CTC。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|3492-3496|共5页
会议地点
作者
Yiqing Feng; Guanyu Fu; Qingcai Chen; Kai Chen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computer Assisted Pronunciation Training (CAPT); Mispronunciation Detection and Diagnosis (MDD); L2 English speech; phone recognition; end-to-end model;

机译：计算机辅助语音训练（CAPT）;误音检测与诊断（MDD）; L2英语语音;电话识别;端到端模型;

相似文献

外文文献
中文文献
专利

1. Efficient personalized mispronunciation detection of Taiwanese-accented English speech based on unsupervised model adaptation and dynamic sentence selection [J] . Chung-Hsien Wu, Hung-Yu Su, Chao-Hong Liu Computer assisted language learning . 2013,第5期

机译：基于无监督模型自适应和动态句子选择的有效的个性化英语口音发音
2. A Two-Pass Framework of Mispronunciation Detection and Diagnosis for Computer-Aided Pronunciation Training [J] . Xiaojun Qian, Helen Meng, Frank Soong Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2016,第6期

机译：计算机辅助语音训练的双误检测和诊断的两阶段框架
3. End-to-End AI-Based Point-of-Care Diagnosis System for Classifying Respiratory Illnesses and Early Detection of COVID-19: A Theoretical Framework [J] . Abdelkader Nasreddine Belkacem, Sofia Ouhbi, Abderrahmane Lakas, Frontiers in Medicine . 2021,第a期

机译：基于端到端的AI的护理点诊断系统，用于分类呼吸道疾病和Covid-19的早期检测：理论框架
4. SED-MDD: Towards Sentence Dependent End-To-End Mispronunciation Detection and Diagnosis [C] . Yiqing Feng, Guanyu Fu, Qingcai Chen, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：SED-MDD：对句子依赖的端到端错乱检测和诊断
5. Mispronunciation detection for language learning and speech recognition adaptation. [D] . Ge, Zhenhao. 2013

机译：错误识别检测用于语言学习和语音识别适应。
6. Fully end-to-end deep-learning-based diagnosis of pancreatic tumors [O] . Ke Si, Ying Xue, Xiazhen Yu, 2021

机译：完全结束的基于深度学习的胰腺肿瘤诊断
7. Phonological Feature Based Mispronunciation Detection and Diagnosis Using Multi-Task DNNs and Active Learning [O] . Vipul Arora, Aditi Lahiri, Henning Reetz 2017

机译：基于音韵的特征错通检测和诊断使用多任务DNN和主动学习

SED-MDD: Towards Sentence Dependent End-To-End Mispronunciation Detection and Diagnosis

摘要

著录项

相似文献

相关主题

期刊订阅