首页> 外文会议>International Conference on speech and computer >Adaptation Approaches for Pronunciation Scoring with Sparse Training Data
【24h】

Adaptation Approaches for Pronunciation Scoring with Sparse Training Data

机译:带有稀疏训练数据的语音评分的自适应方法

获取原文

摘要

In Computer Assisted Language Learning systems, pronunciation scoring consists in providing a score grading the overall pronunciation quality of the speech uttered by a student. In this work, a log-likelihood ratio obtained with respect to two automatic speech recognition (ASR) models was used as score. One model represents native pronunciation while the other one captures non-native pronunciation. Different approaches to obtain each model and different amounts of training data were analyzed. The best results were obtained training an ASR system using a separate large corpus without pronunciation quality annotations and then adapting it to the native and non-native data, sequentially. Nevertheless, when models are trained directly on the native and non-native data, pronunciation scoring performance is similar. This is a surprising result considering that word error rates for these models are significantly worse, indicating that ASR performance is not a good predictor of pronunciation scoring performance on this system.
机译:在计算机辅助语言学习系统中,发音评分包括提供分数,以对学生说出的语音的整体发音质量进行评分。在这项工作中,使用相对于两个自动语音识别(ASR)模型获得的对数似然比作为得分。一种模型表示母语发音,而另一种则捕获非母语发音。分析了获得每种模型的不同方法和不同数量的训练数据。通过使用没有语音质量注释的单独大型语料库训练ASR系统,然后依次使其适应本地和非本地数据,可以获得最佳结果。但是,当直接在本地和非本地数据上训练模型时,语音评分性能是相似的。考虑到这些模型的单词错误率明显更差,这是一个令人惊讶的结果,表明ASR性能不是该系统上语音评分性能的良好预测指标。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号