首页> 外国专利> METHOD AND APPARATUS FOR EXPANSION OF SEARCH QUERIES ON LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION TRANSCRIPTS

METHOD AND APPARATUS FOR EXPANSION OF SEARCH QUERIES ON LARGE VOCABULARY CONTINUOUS SPEECH RECOGNITION TRANSCRIPTS

机译:扩展大型词汇连续语音识别记录中搜索查询的方法和装置

摘要

The subject matter discloses a method for expansion of search queries on large vocabulary continuous speech recognition transcripts comprising: obtaining a textual transcript of audio interaction generated by the large vocabulary continuous speech recognition; generating a topic model from the textual transcripts; said topic model comprises a plurality of topics wherein each topic of the plurality of topics comprises a list of keywords; obtaining a search term; associating a topic from the topic model with the search term; and generating a list of candidate term expansion words by selecting keywords from the list of keywords of the associated topic; said candidate term expansion words are of high probability to be substitution errors of the search term that are generated by the large vocabulary continuous speech recognition.
机译:该主题公开了一种用于扩展对大词汇量连续语音识别转录本的搜索查询的方法,包括:获得由大词汇量连续语音识别生成的音频交互的文本转录本;以及根据文字记录生成主题模型;所述主题模型包括多个主题,其中,多个主题中的每个主题包括关键字列表;获取搜索词;将主题模型中的主题与搜索词相关联;通过从相关主题的关键词列表中选择关键词,生成候选词扩展词列表;所述候选词扩展词很可能是由大词汇量连续语音识别产生的搜索词的替换错误。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号