首页> 外文会议>International Conference on Natural Language Processing and Knowledge Engineering >HMM-based Phonemic Distance in Different Speaking Styles and Its Influence on Substitutions in Mandarin Speech Recognition
【24h】

HMM-based Phonemic Distance in Different Speaking Styles and Its Influence on Substitutions in Mandarin Speech Recognition

机译:基于HMM的音学距离不同的说话方式及其对普通话语音识别替换的影响

获取原文

摘要

Statistical confusability between different acoustic models is important to character substitution error rate in large vocabulary continuous speech recognition. In this paper, we take factors of gender and speaking styles into consideration in Mandarin speech recognition. We modeled phonemes in different speaking styles, including read speech of female, male, and spontaneous dialogue. Then Minimum Gaussian Distances between Chinese Initial/Final model pairs are given and average phoneme distances are calculated which denote the pronunciation varieties. The effect of different style to average phonemic distance is studied and relative articulation is given for three databases. Qualitative relationship between phone size and error rate in recognition is analytical researched, showing that for a particular phoneme, pronunciation variety is one of reasons for misidentification in recognizing process, which provides us a novel mind to reduce substitution errors.
机译:不同声学模型之间的统计混淆对于大型词汇连续语音识别中的字符替代错误率是重要的。在本文中,我们在普通话语音识别中考虑了性别和说话方式的因素。我们以不同的说话方式建模了音素,包括读取女性,男性和自发对话的读物。然后给出了汉语初始/最终模型对之间的最小高斯距离,并计算了平均音素距离,其表示发音。研究了不同风格到平均音素距离的影响,并给出了三​​个数据库的相对关节。电话尺寸与识别中错误率之间的定性关系是分析的,显示出于特定的音素,发音多是识别过程中错误识别的原因之一,这为我们提供了一种新颖的思想来减少替代错误。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号