首页> 外国专利> METHOD AND APPARATUS FOR TRAINING LANGUAGE MODEL ELECTRONIC DEVICE READABLE STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT

METHOD AND APPARATUS FOR TRAINING LANGUAGE MODEL ELECTRONIC DEVICE READABLE STORAGE MEDIUM AND COMPUTER PROGRAM PRODUCT

机译:用于培训语言模型电子设备可读存储介质和计算机程序产品的方法和装置

摘要

The present application discloses a training method, an apparatus, an electronic device and a readable storage medium of a language model, and relates to the technical field of natural language processing in artificial intelligence. A specific implementation method is to use a predetermined text corpus in the corpus base to perform prior training learning on the language model in advance; replacing at least one word in the sample text corpus with a word mask, respectively, to obtain a sample text corpus including at least one word mask, inputting the sample text corpus into the language model, and passing the at least one word through the language model output a context vector of each word mask in the mask; determine a word vector corresponding to each word mask based on a context vector and a word vector parameter matrix of each said word mask, respectively; training the language model based on a word vector corresponding to each of the word masks. By introducing the semantic information representation of a larger granularity, the learning ability of the word semantic information of the language model is enhanced, and the risk of information leakage that is likely caused by the whole word masking of the characters can be effectively avoided.
机译:本申请公开了一种训练方法,装置,电子设备和语言模型的可读存储介质,并且涉及人工智能中的自然语言处理技术领域。特定的实现方法是在语料库基础中使用预定文本语料库,以提前对语言模型进行先前训练学习;用单词掩码替换示例文本语料库中的至少一个单词,以获取示例文本语料库,包括至少一个单词掩码,将示例文本语料库输入到语言模型中,并通过语言传递至少一个单词模型输出掩码中每个单词掩码的上下文向量;基于上下文向量和每个所述单词掩码的单词矢量参数矩阵确定与每个单词掩码对应的单词矢量。基于对应于每个单词掩码的单词矢量训练语言模型。通过引入更大粒度的语义信息表示,提高了语言模型的语义信息的学习能力,并且可以有效地避免了由字符的整个单词屏蔽引起的信息泄漏的风险。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号