Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework

Tsai M.-Y.; Chou F.-C.; Lee L.-S.

首页> 外文期刊>IEEE transactions on audio, speech and language processing >Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework

【24h】

Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework

机译：使用三阶段框架的汉语普通话语音建模，减少了混乱

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Multiple-pronunciation dictionaries have been found to be useful in pronunciation modeling for speech recognition. However, the extra pronunciation variants added in the dictionary inevitably increase the confusion among different words during recognition, and consequently limit the achievable improvements in the recognition performance. This paper proposes a three-stage framework for Mandarin Chinese to construct automatically the multiple-pronunciation dictionary while reducing the possible confusion caused. The proposed framework includes pronunciation generation (Stage 1), ranking (Stage 2) and pruning (Stage3). New measures of confusability for multiple-pronunciation dictionaries were developed and shown to have a very strong correlation with recognition performance. With the proposed framework, it was shown that the confusability as measured can be reduced and recognition performance improved stage by stage. All of the above findings were verified by a series of experiments performed on both planned (LDC HUB-4NE) and spontaneous (LDC CALLHOME) Mandarin Chinese speech corpora

机译：已经发现，多发音词典在语音识别的语音建模中很有用。但是，词典中添加的额外发音变体不可避免地增加了识别过程中不同单词之间的混乱，因此限制了可实现的识别性能改进。本文提出了一个三阶段的框架，用于汉语普通话自动构建多重发音词典，同时减少可能引起的混淆。提议的框架包括发音生成（阶段1），排名（阶段2）和修剪（阶段3）。制定了多种发音词典的易混淆性新指标，并显示出与识别性能非常相关。利用所提出的框架，可以逐步降低所测量的可混淆性并提高识别性能。以上所有发现均通过在计划中（LDC HUB-4NE）和自发（LDC CALLHOME）汉语普通话语料库上进行的一系列实验得到了验证。

著录项

来源
《IEEE transactions on audio, speech and language processing》 |2007年第2期|p.661-675|共15页
作者
Tsai M.-Y.; Chou F.-C.; Lee L.-S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类自动化技术、计算机技术;
关键词
natural languages; speech recognition; Mandarin Chinese; multiple-pronunciation dictionary; pronunciation generation; pronunciation modeling; speech recognition; three-stage framework; Confusability; confusion; multiple-pronunciation dictionary; pronunciation model;

机译：自然语言;语音识别;汉语普通话;多发音词典;发音生成;发音建模;语音识别;三阶段框架;易混淆性;困惑;多发音词典;发音模型;

相似文献

外文文献
中文文献
专利

1. Mixed Models Based Pronunciation Evaluation of Mandarin Tone [J] . Zhang Long, Li Haifeng, Ma Lin, Journal of Multimedia . 2013,第6期

机译：基于混合模型的普通话语音评价
2. Effective Acoustic Modeling for Pronunciation Quality Scoring of Strongly Accented Mandarin Speech [J] . Fengpei GE, Changliang LIU, Jian SHAO, IEICE Transactions on Information and Systems . 2008,第10期

机译：针对重音普通话语音质量得分的有效声学建模
3. Pronunciation Modeling for Spontaneous Mandarin Speech Recognition [J] . YI LIU, PASCALE FUNG International journal of speech technology . 2004,第2a3期

机译：自发普通话语音识别的语音建模
4. An Application of Modified Confusion Network for Improving Mispronunciation Detection in Computer-aided Mandarin Pronunciation Training [C] . Jun Qi, Ruiying Wei, Runsheng Liu Asia-Pacific Signal and Information Processing Association Annual Summit and Conference . 2011

机译：改进的混淆网络在错误辅助检测中的应用
5. How Does the Pronunciation of Native Languages Affect Beginning Singers? A Research Focusing on Native Mandarin Chinese and American English Speaking Singers [D] . Zhao, Ruobing. 2019

机译：母语如何影响开始歌手？专注于本土普通话和美国英语歌手的研究
6. A Tutoring Package to Teach Pronunciation of Mandarin Chinese Characters [O] . Hang Wu, L Keith Miller 1997

机译：教授普通话发音的辅导包

Pronunciation Modeling With Reduced Confusion for Mandarin Chinese Using a Three-Stage Framework

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅