Articulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning

Richeng DUAN; Tatsuya KAWAHARA; Masatake DANTSUJI; Jinsong ZHANG

首页> 外文期刊>IEICE transactions on information and systems >Articulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning

【24h】

Articulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning

机译：基于DNN传递学习的非母语训练数据语音错误检测的关节建模

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Aiming at detecting pronunciation errors produced by second language learners and providing corrective feedbacks related with articulation, we address effective articulatory models based on deep neural network (DNN). Articulatory attributes are defined for manner and place of articulation. In order to efficiently train these models of non-native speech without such data, which is difficult to collect in a large scale, several transfer learning based modeling methods are explored. We first investigate three closely-related secondary tasks which aim at effective learning of DNN articulatory models. We also propose to exploit large speech corpora of native and target language to model inter-language phenomena. This kind of transfer learning can provide a better feature representation of non-native speech. Related task transfer and language transfer learning are further combined on the network level. Compared with the conventional DNN which is used as the baseline, all proposed methods improved the performance. In the native attribute recognition task, the network-level combination method reduced the recognition error rate by more than 10% relative for all articulatory attributes. The method was also applied to pronunciation error detection in Mandarin Chinese pronunciation learning by Japanese native speakers, and achieved the relative improvement up to 17.0% for detection accuracy and up to 19.9% for F-score, which is also better than the lattice-based combination.

机译：为了检测第二语言学习者产生的发音错误并提供与发音有关的纠正反馈，我们针对基于深度神经网络（DNN）的有效发音模型。针对关节的方式和位置定义了关节属性。为了有效地训练这些没有大量数据而难以收集的非母语语音模型，探索了几种基于迁移学习的建模方法。我们首先调查三个紧密相关的次要任务，旨在有效学习DNN关节模型。我们还建议利用本地和目标语言的大型语音语料库来建模中间语言现象。这种转移学习可以提供更好的非母语语音特征表示。相关的任务转移和语言转移学习在网络级别上进一步组合。与用作基准的常规DNN相比，所有提出的方法均提高了性能。在本机属性识别任务中，相对于所有关节属性，网络级组合方法将识别错误率降低了10％以上。该方法还适用于日语为母语的汉语普通话发音学习中的语音错误检测，检测精度相对提高了17.0％，F评分相对提高了19.9％，也优于基于格的方法。组合。

著录项

来源
《IEICE transactions on information and systems》 |2017年第9期|共9页
作者
Richeng DUAN; Tatsuya KAWAHARA; Masatake DANTSUJI; Jinsong ZHANG;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Pronunciation Error Detection using DNN Articulatory Model based on Transfer Learning [J] . Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji 電子情報通信学会技術研究報告. 音声. Speech . 2016,第189期

机译：基于转移学习的DNN发音模型的语音错误检测
2. Cross-Lingual Transfer Learning of Non-Native Acoustic Modeling for Pronunciation Error Detection and Diagnosis [J] . Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji, Audio, Speech, and Language Processing, IEEE/ACM Transactions on . 2020,第期

机译：非本地声学建模的交叉语言转移学习发音错误检测和诊断
3. Phonological feature-based speech recognition system for pronunciation training in non-native language learning [J] . Arora Vipul, Lahiri Aditi, Reetz Henning The Journal of the Acoustical Society of America . 2018,第1期

机译：基于语音特征的语音识别系统，用于非母语学习中的发音培训
4. Effective articulatory modeling for pronunciation error detection of L2 learner without non-native training data [C] . Richeng Duan, Tatsuya Kawahara, Masatake Dantsuji, IEEE International Conference on Acoustics, Speech and Signal Processing . 2017

机译：在没有非本地训练数据的情况下，有效的发音建模可用于L2学习者的语音错误检测
5. Discriminative Articulatory Feature-based Pronunciation Models with Application to Spoken Term Detection [D] . Prabhavalkar, Rohit. 2013

机译：基于区分性发音特征的语音模型及其在口语检测中的应用
6. DNN-MVL: DNN-Multi-View-Learning-Based Recover Block Missing Data in a Dam Safety Monitoring System [O] . Yingchi Mao, Jianhua Zhang, Hai Qi, 2019

机译：DNN-MVL：大坝安全监控系统中基于DNN-多视图学习的恢复块丢失数据
7. Articulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning [O] . Richeng DUAN, Tatsuya KAWAHARA, Masatake DANTSUJI, 2017

机译：没有基于DNN转移学习的非本机训练数据的发音错误检测的明确模型

Articulatory Modeling for Pronunciation Error Detection without Non-Native Training Data Based on DNN Transfer Learning

摘要

著录项

相似文献

相关主题

期刊订阅