首页> 外国专利> Method and apparatus for large vocabulary continuous speech recognition based on dynamic addition of vocabulary to be recognized

Method and apparatus for large vocabulary continuous speech recognition based on dynamic addition of vocabulary to be recognized

机译:基于待识别词汇动态加法的大词汇量连续语音识别方法及装置

摘要

The present invention improves the recognition rate of the speech recognition system by using the multimedia data such as broadcasting, conference recording, call center conversation recording, and the like when users provide and add words or strings that are expected to be frequently generated errors. A method and apparatus for continuous speech recognition (large vocabulary continuous speech recognition) are proposed. According to an aspect of the present invention, in a continuous speech recognition system, applying a'BLANK' vocabulary to a language model in advance to construct a language model (vocabulary extension language model) capable of extending the vocabulary; Providing a user with a user-defined word/word phrase as an input to the speech recognition system, and inserting the user-defined word/word phrase into the'BLANK' vocabulary previously prepared in the vocabulary-extended language model search space; Provided is a method for continuously recognizing a large vocabulary based on a dynamic recognition target vocabulary, comprising receiving a user's spoken voice and performing a decoding process for the speech.
机译:当用户提供并添加期望经常产生错误的单词或字符串时,本发明通过使用诸如广播,会议记录,呼叫中心对话记录等的多媒体数据来提高语音识别系统的识别率。提出了一种用于连续语音识别(大词汇量连续语音识别)的方法和设备。根据本发明的一个方面,在连续语音识别系统中,预先将“空白”词汇应用于语言模型以构建能够扩展词汇的语言模型(词汇扩展语言模型)。向用户提供用户定义的单词/单词短语作为语音识别系统的输入,并将用户定义的单词/单词短语插入先前在词汇扩展语言模型搜索空间中准备的“ BLANK”词汇中;提供了一种基于动态识别目标词汇表连续识别大词汇表的方法,该方法包括:接收用户的口头语音;以及对该语音进行解码处理。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号