首页> 外文会议> >Large vocabulary continuous Mandarin speech recognition using finite state machine
【24h】

Large vocabulary continuous Mandarin speech recognition using finite state machine

机译:基于有限状态机的大词汇量连续汉语语音识别

获取原文

摘要

The finite state transducer (FST), popularly used in the natural language processing (NLP) area to represent the grammar rules and the characteristics of a language, has been extensively used as the core in large vocabulary continuous speech recognition (LVCSR) in recent years. By means of FST, we can effectively compose the acoustic model, pronunciation lexicon, and language model to form a compact search space. In this paper, we present our approach to developing a LVCSR decoder using FST as the core. In addition, the traditional one-pass tree-copy search algorithm is also described for comparison in terms of speed, memory requirements and achieved character accuracy.
机译:有限状态换能器(FST)广泛用于自然语言处理(NLP)领域,以表示语法规则和语言特征,近年来已广泛用作大词汇量连续语音识别(LVCSR)的核心。通过FST,我们可以有效地组合声学模型,发音词典和语言模型,以形成紧凑的搜索空间。在本文中,我们介绍了以FST为核心开发LVCSR解码器的方法。此外,还描述了传统的一遍树状副本搜索算法,以进行速度,内存需求和已实现字符精度方面的比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号