首页> 外文会议>Annual Pacific Voice Conference >Statistics of diphones and triphones presence on the word boundaries in the Polish language. Applications to ASR
【24h】

Statistics of diphones and triphones presence on the word boundaries in the Polish language. Applications to ASR

机译:迪菲斯和三宝石在波兰语中的界限上存在的统计数据。应用于ASR.

获取原文
获取外文期刊封面目录资料

摘要

Recognition of continuous speech is one of the major challenges in automatic speech recognition (ASR), especially in phonetically complex languages (i.e. Polish). To improve ASR of the Polish language, we obtained phoneme statistics to locate diphones and triphones within the running speech sequences. We found that these clusters occur more likely between the words boundaries rather than within the word boundaries. Our research identified the most frequently appearing diphones and triphones in the natural speech corpus (Corpora) and we normalized these data for the Polish language at large. The results can be used in the various ASR application systems, i.e. by the speech recognizer module to enhance word boundaries recognitions, or to recognize non-dictionary words embedded in a natural sentence, (e.g. proper names).
机译:持续演讲的认可是自动语音识别(ASR)中的主要挑战之一,特别是在语音复杂语言中(即抛光)。为了改善波兰语的ASR,我们获得了位于运行语音序列中的Diphones和Triphones的音素统计数据。我们发现这些群集在字边界之间更有可能发生,而不是在字边界内发生。我们的研究确定了自然语音语料库(Corpora)中最常出现的偶像和三倍声音,我们将这些数据正常化为大的波兰语。结果可以在各种ASR应用系统中使用,即由语音识别器模块增强字界识别,或者识别嵌入在自然句中的非字典单词(例如,正确的名称)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号