首页> 外文会议>International Speech Communication Association >Phonetic Confusion Analysis and Robust Phone Set Generation forShanghai-Accented Mandarin Speech Recognition
【24h】

Phonetic Confusion Analysis and Robust Phone Set Generation forShanghai-Accented Mandarin Speech Recognition

机译:语音混乱分析与强大的手机集生成副高音普通话语音识别

获取原文
获取外文期刊封面目录资料

摘要

In this paper, accent issues are discussed for Shanghai-accented Mandarin speech recognition. The phonetic confusion is analyzed in detail based on the alignment between the surface form and the baseform transcriptions. Mutual information is used as the measure to extract the most confusing phoneme pairs. It was found that each phoneme in one pair can be easily misrecognized with the other. To remove the phonetic confusion, it is better to replace the two phonemes in one pair with a newly generated one. Consequentially new phone sets are derived. The phonetic confusion analysis and the experimental evaluation are performed on a Shanghai-accented Mandarin speech corpus. Experimental results show that compared to the canonical phone set, the generated one can reduce the substitution error greatly and achieve a 0.72% absolute Chinese character error rate (CER) reduction. When it is combined with pronunciation modeling, the absolute CER reduction is 1.58%.
机译:本文讨论了上海重点普通话致辞认可的口音问题。基于表面形式和基础转录之间的对准详细分析语音混淆。用作提取最令人困惑的音素对的衡量标准。发现一对中的每个音素可以容易地与另一对误导。为了消除语音混淆,最好用一个新生成的一个更换一对中的两个音素。因此,新的手机组是派生的。在上海重点的普通话语料库中进行了语音混淆分析和实验评价。实验结果表明,与规范电话集相比,所生成的结果可以大大降低替代误差并达到0.72%的绝对汉字错误率(CER)减少。当它与发音建模结合时,绝对CER减少为1.58%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号