首页>
外国专利>
DISTRIBUTED SYNCHRONOUS TRAINING ARCHITECTURE USING STALE WEIGHTS
DISTRIBUTED SYNCHRONOUS TRAINING ARCHITECTURE USING STALE WEIGHTS
展开▼
机译:使用陈旧重量分布式同步训练架构
展开▼
页面导航
摘要
著录项
相似文献
摘要
A computer-implemented method for distributed synchronous training of a neural network model includes performing, by a worker machine of a plurality of worker machines, a forward computation of a training data set using a plurality of N layers of the neural network model. The forward computation starts at Layer 1 and proceeds through Layer N of the neural network model. The method further includes performing, by the worker machine, a backward computation of the training data set, the backward computation starting at Layer N and proceeding through Layer 1 of the neural network model. The method further includes synchronizing, by the worker machine, a plurality of gradients outputted by the neural network model during the backward computation. The synchronizing of the plurality of gradients is performed with other worker machines of the plurality of worker machines and in parallel with the backward computation.
展开▼