首页> 外文会议>Iberoamerican Congress on Pattern Recognition >Statistical and Linguistic Clustering for Language Modeling in ASR
【24h】

Statistical and Linguistic Clustering for Language Modeling in ASR

机译:ASR语言建模的统计和语言聚类

获取原文

摘要

In this work several sets of categories obtained by a statistical clustering algorithm, as well as a linguistic set, were used to design category-based language models. The language models proposed were evaluated, as usual, in terms of perplexity of the text corpus. Then they were integrated into an ASR system and also evaluated in terms of system performance. It can be seen that category-based language models can perform better, also in terms of WER, when categories are obtained through statistical models instead of using linguistic techniques. They also show that better system performance are obtained when the language model interpolates category based and word based models.
机译:在这项工作中,通过统计聚类算法以及语言集获得了几组类别,用于设计基于类别的语言模型。在文本语料库的困惑方面,提出的语言模型被评估。然后它们被集成到ASR系统中,并在系统性能方面进行评估。可以看出,基于类别的语言模型也可以更好地执行,而且在WER方面,当通过统计模型而不是使用语言技术获得类别时。他们还表明,当语言模型内插基于基于词的模型时,获得了更好的系统性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号