首页> 外国专利> Methods and equipment for training deep learning models, electronic devices, computer-readable storage media and computer programs

Methods and equipment for training deep learning models, electronic devices, computer-readable storage media and computer programs

机译：培训深度学习模型的方法和设备、电子设备、计算机可读存储媒体和计算机程序

页面导航

摘要
著录项
相似文献

摘要

The embodiments of the present disclosure disclose methods and devices for training deep learning models. One specific embodiment of the method is to divide the model description information into at least two sections based on the step of acquiring the model description information and the configuration information of the deep learning model and the division point variable in the configuration information, and the model. The step of loading the descriptive information into the corresponding resource and executing it, and after inputting the training sample of a batch into the resource corresponding to the model descriptive information in the first section, the training is started, and the obtained context information is applied to the following. When the step of inputting the resource corresponding to the model description information of the section, the step of repeating as described above until the execution result of the resource corresponding to the model description information of the last section is obtained, and the training end condition are satisfied. Includes a step of outputting a trained deep learning model and, if the training end condition is not met, a step of taking a training sample of the next batch and executing the training step until the training end condition is met. The embodiment can realize a free combination of different types of devices, fully exert the computing power of different computing devices, and improve the training speed. [Selection diagram] Fig. 2

机译：本发明的实施例公开了用于训练深度学习模型的方法和设备。该方法的一个具体实施例是，基于获取深度学习模型的模型描述信息和配置信息以及配置信息中的分割点变量和模型的步骤，将模型描述信息划分为至少两个部分。将描述信息加载到相应的资源中并执行它的步骤，并且在将批次的训练样本输入到第一部分中的模型描述信息对应的资源中之后，开始训练，并且获得的上下文信息应用于以下内容。当输入与该部分的模型描述信息相对应的资源的步骤时，重复如上所述的步骤，直到获得与最后一部分的模型描述信息相对应的资源的执行结果，并且满足训练结束条件。包括输出经过训练的深度学习模型的步骤，如果不满足训练结束条件，则包括获取下一批训练样本并执行训练步骤直到满足训练结束条件的步骤。本实施例可以实现不同类型设备的自由组合，充分发挥不同计算设备的计算能力，提高训练速度。[选择图]图2

著录项

公开/公告号JP7029554B2

专利类型
公开/公告日2022-03-03

原文格式PDF
申请/专利权人ベイジンバイドゥネットコムサイエンステクノロジーカンパニーリミテッド;
展开▼

申请/专利号JP20200572679
发明设计人ホー、ティエンジェン;リウ、イー;トン、ターシアン;マー、イェンジュン;ユイ、ティエンハイ;
展开▼

申请日2020-06-09
分类号G06N20;G06F9/50;
国家 JP
入库时间 2022-08-24 23:56:22

相似文献

专利
外文文献
中文文献