首页> 外国专利> ISA-based compression in distributed training of neural networks

ISA-based compression in distributed training of neural networks

机译：基于ISA的神经网络分布式培训压缩

页面导航

摘要
著录项
相似文献

摘要

An overall gradient vector is computed at a server from a set of ISA vectors corresponding to a set of worker machines. An ISA vector of a worker machine including ISA instructions corresponding to a set of gradients, each gradient corresponding to a weight of a node of a neural network being distributedly trained in the worker machine. A set of register values is optimized for use in an approximation computation with an opcode to produce an x-th approximate gradient of an x-th gradient. A server ISA vector is constructed in which a server ISA instruction in an x-th position corresponds to the x-th gradient in the overall gradient vector. A processor at the worker machine is caused to update a set of weights of the neural network, using the set of optimized register values and the server ISA vector, thereby completing one iteration of training.

机译：从对应于一组工作机器的一组ISA向量计算的服务器上计算总梯度向量。一个ISA向量的工人机器，包括ISA指令，对应于一组渐变，每个梯度对应于神经网络的节点的权重被分布在工人机器中。一组寄存器值被优化以用于使用操作码的近似计算，以产生X-TH梯度的X-TH近似梯度。构建服务器ISA向量，其中X-TH位置中的服务器ISA指令对应于整个梯度向量中的X-TH梯度。工人机器的处理器是使用该组优化寄存器值和服务器ISA向量更新一组神经网络的重量，从而完成一次训练的一次迭代。

著录项

公开/公告号US10977552B2

专利类型
公开/公告日2021-04-13

原文格式PDF
申请/专利权人 INTERNATIONAL BUSINESS MACHINES CORPORATION;
展开▼

申请/专利号US201715709955
发明设计人 MINSIK CHO;ULRICH A. FINKLER;
展开▼

申请日2017-09-20
分类号G06N3/08;G06N3/04;G06F9/30;G06F9/46;
国家 US
入库时间 2022-08-24 18:10:56

相似文献

专利
外文文献
中文文献