首页> 外文期刊>IEICE Transactions on Information and Systems >Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech
【24h】

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

机译:针对重音普通话语音质量得分的有效声学建模

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we present our investigation into improving the performance of our computer-assisted language learning (CALL) system through exploiting the acoustic model and features within the speech recognition framework. First, to alleviate channel distortion, speaker-dependent cepstrum mean normalization (CMN) is adopted and the average correlation coefficient (average CC) between machine and expert scores is improved from 78.00% to 84.14%. Second, heteroscedastic linear discriminant analysis (HLDA) is adopted to enhance the discriminability of the acoustic model, which successfully increases the average CC from 84.14% to 84.62%. Additionally, HLDA causes the scoring accuracy to be more stable at various pronunciation proficiency levels, and thus leads to an increase in the speaker correct-rank rate from 85.59% to 90.99%. Finally, we use maximum a posteriori (MAP) estimation to tune the acoustic model to fit strongly accented test speech. As a result, the average CC is improved from 84.62% to 86.57%. These three novel techniques improve the accuracy of evaluating pronunciation quality.
机译:在本文中,我们将通过利用语音识别框架内的声学模型和功能,来提高计算机辅助语言学习(CALL)系统的性能。首先,为了减轻声道失真,采用了说话人相关的倒谱平均归一化(CMN),并且机器和专家评分之间的平均相关系数(平均CC)从78.00%提高到84.14%。其次,采用异方差线性判别分析(HLDA)来增强声学模型的可判别性,从而成功地将平均CC从84.14%提高到84.62%。另外,HLDA使得评分准确性在各种发音水平上更加稳定,因此导致说话人正确评级率从85.59%增加到90.99%。最后,我们使用最大后验(MAP)估计来调整声学模型以适合重音测试语音。结果,平均CC从84.62%提高到86.57%。这三种新颖的技术提高了评估语音质量的准确性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号