首页> 外国专利> METHOD FOR COMPRESSION OF NEURAL NETWORK MODEL AND METHOD AND APPARATUS FOR LANGUAGE CORPORA TRANSLATION

METHOD FOR COMPRESSION OF NEURAL NETWORK MODEL AND METHOD AND APPARATUS FOR LANGUAGE CORPORA TRANSLATION

机译:压缩神经网络模型的方法和语言语言翻译方法和装置

摘要

FIELD: information storage.;SUBSTANCE: invention relates to a method, an apparatus, a computer-readable data storage medium for compression of a neural network model and a method for translation of a language corpora. The method includes obtaining a set of training samples including multiple pairs of training samples, wherein each pair of training samples includes source data and target data corresponding with the source data; training the initial teacher model using said initial data as input data and using said target data as control data; training one or more intermediate teacher models based on said set of training samples and the initial teacher model, wherein said one or more intermediate teacher models form a set of teacher models; training multiple candidate student models based on said set of training samples, initial teacher model and set of teacher models, wherein said set of candidate student models forms a set of student models; estimating the accuracy of the output results of the multiple candidate student models using a set of control data and selecting one of the multiple candidate student models as a target student model in accordance with the accuracy, wherein the number of model parameters of any of the intermediate teacher models is less than that of the initial teacher model and the number of model parameters of the candidate student models is less than that of any of the intermediate teacher models.;EFFECT: increased efficiency of compression of a neural network model.;18 cl, 9 dwg
机译:字段:信息存储。物质:发明涉及用于压缩神经网络模型的方法,装置,计算机可读数据存储介质以及语言语料库的翻译方法。该方法包括获得包括多对训练样本的一组训练样本,其中每对训练样本包括与源数据相对应的源数据和目标数据;使用所述初始数据作为输入数据训练初始教师模型,并使用所述目标数据作为控制数据;基于所述一组训练样本和初始教师模型训练一个或多个中间教师模型,其中所述一个或多个中间教师模型形成一组教师模型;培训基于所述培训样本,初始教师模型和教师模型集的多个候选学生模型,其中所述一组候选学生模型形成一组学生模型;使用一组控制数据估计多个候选学生模型的输出结果的准确性,并根据准确性选择多个候选学生模型作为目标学生模型之一,其中任何中间的模型参数的数量教师模型的初始教师模型的模型少于初始教师模型的模型参数数量小于任何中间教师模型的模型参数。;效果:增加神经网络模型的压缩效率。; 18 cl ,9 dwg.

著录项

  • 公开/公告号RU2749970C1

    专利类型

  • 公开/公告日2021-06-21

    原文格式PDF

  • 申请/专利权人

    申请/专利号RU20200102388

  • 申请日2019-11-26

  • 分类号G06N3/08;G06F40/58;

  • 国家 RU

  • 入库时间 2022-08-24 19:34:15

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号