首页> 外国专利> LANGUAGE MODEL TRAINED USING PREDICTED QUERIES FROM STATISTICAL MACHINE TRANSLATION

LANGUAGE MODEL TRAINED USING PREDICTED QUERIES FROM STATISTICAL MACHINE TRANSLATION

机译:使用统计机器翻译中的预测查询训练语言模型

摘要

A Statistical Machine Translation (SMT) model (165) is trained using pairs of sentences that include content obtained from one or more content sources (e.g. feed(s)) with corresponding queries that have been used to access the content. A query click graph (130) may be used to assist in determining candidate pairs for the SMT training data. All/portion of the candidate pairs may be used to train the SMT model. After training the SMT model using the SMT training data, the SMT model is applied to content to determine predicted queries (154) that may be used to search for the content. The predicted queries are used to train a language model, such as a query language model. The query language model may be interpolated other language models, such as a background language model, as well as a feed language model trained using the content used in determining the predicted queries.
机译:使用成对的句子来训练统计机器翻译(SMT)模型(165),这些句子包括从一个或多个内容源(例如提要)获得的内容以及已用于访问内容的相应查询。查询点击图(130)可以用于协助确定用于SMT训练数据的候选对。候选对的全部/部分可以用于训练SMT模型。在使用SMT训练数据训练了SMT模型之后,将SMT模型应用于内容以确定可以用于搜索内容的预测查询(154)。预测的查询用于训练语言模型,例如查询语言模型。查询语言模型可以被内插其他语言模型,例如背景语言模型,以及使用在确定预测查询中使用的内容训练的提要语言模型。

著录项

  • 公开/公告号WO2014190220A3

    专利类型

  • 公开/公告日2015-05-14

    原文格式PDF

  • 申请/专利权人 MICROSOFT CORPORATION;

    申请/专利号WO2014US39258

  • 申请日2014-05-23

  • 分类号G06F17/27;G10L15/197;G06F17/30;G10L15/06;G06F17/28;

  • 国家 WO

  • 入库时间 2022-08-21 15:09:56

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号