首页> 外国专利> THE METHOD AND APPARATUS FOR GENERATING EXTENDABLE CFG TYPE VOICE RECOGNITION GRAMMAR BASED ON CORPUS

THE METHOD AND APPARATUS FOR GENERATING EXTENDABLE CFG TYPE VOICE RECOGNITION GRAMMAR BASED ON CORPUS

机译:基于语料库的扩展的cfg类型语音识别语法生成方法及装置

摘要

A method and an apparatus for generating an extendable CFG-type voice recognition grammar based on a corpus are provided to describe and extend a CFG-type voice recognition grammar even when the corpus is small to perform continuous voice recognition in a specific area, thereby improving the accuracy and efficiency of the voice recognition. A method for generating an extendable CFG(Context-Free Grammar)-type voice recognition grammar based on a corpus comprises the following steps of: converting the corpus into a CFG-type voice recognition grammar pattern by using thesaurus or converting rules(S200); adding at least one of language used in a conversational style, low-ranking words included in a thesaurus, words used in a corresponding voice recognition area, and synonyms of declinable words to the CFG-type voice recognition grammar pattern to extend the CFG-type voice recognition grammar pattern(S300); and removing impossible to express in meaning in the extended CFG-type voice recognition grammar pattern(S400).
机译:提供了一种用于基于语料库生成可扩展的CFG型语音识别语法的方法和装置,以描述和扩展即使在语料库很小的情况下也可以在特定区域执行连续语音识别的CFG型语音识别语法,从而改善了语音识别的准确性和效率。一种基于语料库的可扩展CFG(Context-Free Grammar)型语音识别语法的生成方法,包括以下步骤:利用词库或转换规则将语料库转换为CFG型语音识别语法模式(S200);在CFG类型的语音识别语法模式中添加至少一种用于会话风格的语言,同义词库中包含的低排名单词,在相应的语音识别区域中使用的单词以及可拒绝单词的同义词以扩展CFG类型语音识别语法模式(S300);并消除扩展的CFG型语音识别语法模式中无法表达的意思(S400)。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号