Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

Fengpei GE; Changliang LIU; Jian SHAO; Fuping PAN; Bin DONG; Yonghong YAN

首页> 外文期刊>IEICE Transactions on Information and Systems >Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

【24h】

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

机译：针对重音普通话语音质量得分的有效声学建模

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present our investigation into improving the performance of our computer-assisted language learning (CALL) system through exploiting the acoustic model and features within the speech recognition framework. First, to alleviate channel distortion, speaker-dependent cepstrum mean normalization (CMN) is adopted and the average correlation coefficient (average CC) between machine and expert scores is improved from 78.00% to 84.14%. Second, heteroscedastic linear discriminant analysis (HLDA) is adopted to enhance the discriminability of the acoustic model, which successfully increases the average CC from 84.14% to 84.62%. Additionally, HLDA causes the scoring accuracy to be more stable at various pronunciation proficiency levels, and thus leads to an increase in the speaker correct-rank rate from 85.59% to 90.99%. Finally, we use maximum a posteriori (MAP) estimation to tune the acoustic model to fit strongly accented test speech. As a result, the average CC is improved from 84.62% to 86.57%. These three novel techniques improve the accuracy of evaluating pronunciation quality.

机译：在本文中，我们将通过利用语音识别框架内的声学模型和功能，来提高计算机辅助语言学习（CALL）系统的性能。首先，为了减轻声道失真，采用了说话人相关的倒谱平均归一化（CMN），并且机器和专家评分之间的平均相关系数（平均CC）从78.00％提高到84.14％。其次，采用异方差线性判别分析（HLDA）来增强声学模型的可判别性，从而成功地将平均CC从84.14％提高到84.62％。另外，HLDA使得评分准确性在各种发音水平上更加稳定，因此导致说话人正确评级率从85.59％增加到90.99％。最后，我们使用最大后验（MAP）估计来调整声学模型以适合重音测试语音。结果，平均CC从84.62％提高到86.57％。这三种新颖的技术提高了评估语音质量的准确性。

著录项

来源
《IEICE Transactions on Information and Systems》 |2008年第10期|p.2485-2492|共8页
作者
Fengpei GE; Changliang LIU; Jian SHAO; Fuping PAN; Bin DONG; Yonghong YAN;
展开▼
作者单位

ThinkIT Speech Lab., China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;
关键词
CALL; speech recognition; HLDA; speaker-dependent CMN; e-learning;

机译：CALL;语音识别;HLDA;与说话者相关的CMN;在线学习;

相似文献

外文文献
中文文献
专利

1. Pronunciation Modeling for Spontaneous Mandarin Speech Recognition [J] . YI LIU, PASCALE FUNG International journal of speech technology . 2004,第2a3期

机译：自发普通话语音识别的语音建模
2. Modeling partial pronunciation variations for spontaneous Mandarin speech recognition [J] . Yi Liu, Pascale Fung Computer speech and language . 2003,第4期

机译：为自发普通话语音识别建模部分发音变化
3. Multilingual recognition of non-native speech using acoustic model transformation and pronunciation modeling [J] . G. Bouselmi, D. Fohr, I. Illina International journal of speech technology . 2012,第2期

机译：使用声学模型转换和语音建模对非母语语音进行多语言识别
4. Some Acoustic Improvements for Pronunciation Quality Assessment for Strongly Accented Mandarin Speech [C] . Fengpei Ge, Fuping Pan, Changliang Liu, 2008 International Conference on Audio，Language and Image Processing（2008国际声音、语言、图像过程大会）论文集 . 2008

机译：强音普通话语音质量评估中的一些声学改进
5. Pronunciation modeling for spontaneous Mandarin speech recognition. [D] . Liu, Yi. 2002

机译：用于自发普通话语音识别的语音建模。
6. Listening with a foreign-accent: The interlanguage speech intelligibility benefit in Mandarin speakers of English [O] . Xin Xie, Carol A. Fowler -1

机译：带有异味的听力：讲普通话的英语者的中介语语音清晰度
7. PARTIAL CHANGE ACCENT MODELS FOR ACCENTED MANDARIN SPEECH RECOGNITION [O] . Liu Yi, Pascale Fung 2010

机译：重读普通话语音识别的部分更改重音模型

Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech

摘要

著录项

相似文献

相关主题

期刊订阅