首页> 外国专利> TEXT PREDICTION MODEL TRAINING METHOD AND APPARATUS

TEXT PREDICTION MODEL TRAINING METHOD AND APPARATUS

机译:文本预测模型训练方法和装置

摘要

Disclosed are a text prediction model training method executed by a computer, and a text prediction model training apparatus. A text prediction model comprises a first prediction network (11) based on a time sequence, a buffer (12), and a second prediction network (13) based on the buffer (12). The training method comprises: inputting a t-th word in training text into a first prediction network (11), such that the first prediction network determines a first prediction probability for the next word according a state vector obtained by means of time sequence processing; in addition, reading, from a buffer (12), several fragment vectors formed on the basis of the previous text, and a second prediction network (13) obtaining a second prediction probability for the next word according to these fragment vectors; then, by taking an interpolation weight coefficient λ as a weighting coefficient of the second prediction probability, and taking one minus λ as a weighting coefficient of the first prediction probability, weighing and synthesizing the second prediction probability and the first prediction probability in order to obtain a comprehensive prediction probability; and at least according to the comprehensive prediction probability and a (t+1)th word, determining a prediction loss regarding the t-th word, and thereby training a text prediction model.
机译:本发明公开了一种由计算机执行文本预测模型训练方法,和一个文本预测模型训练装置。文本预测模型包括基于时间序列的第一预测网络(11),缓冲器(12),并基于该缓冲器(12)的第二预测网络(13)。该训练方法包括:输入在训练文本第t个字转换成第一预测网络(11),使得第一预测网络确定用于根据由时间序列处理的手段获得的状态向量的下一个单词的第一预测概率;此外,阅读,从缓冲器(12),形成在前文的基础上几个片段的载体,并获得用于根据这些片段的载体的下一个单词的第二预测概率的第二预测网络(13);然后,通过以获得取内插权重系数λ作为第二预测概率的加权系数,和采取一个减去λ作为第一预测概率的加权系数,称重和合成第二预测概率和所述第一预测概率全面的预测概率;和至少根据所述综合预测概率和(T + 1)个字,确定关于第t个字的预测损失,由此训练文本预测模型。

著录项

  • 公开/公告号WO2021155705A1

    专利类型

  • 公开/公告日2021-08-12

    原文格式PDF

  • 申请/专利号WO2020CN132617

  • 发明设计人 LI YANGMING;YAO KAISHENG;

    申请日2020-11-30

  • 分类号G06F40/211;G06F40/216;G06F40/284;G06N3/04;G06N3/08;

  • 国家 CN

  • 入库时间 2022-08-24 20:36:33

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号