首页> 外文会议>2017 IEEE 24th International Conference on High Performance Computing >Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication
【24h】

Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication

机译:利用计算和通信的重叠进行并行深度卷积神经网络训练

获取原文
获取原文并翻译 | 示例

摘要

Training Convolutional Neural Network (CNN) is a computationally intensive task whose parallelization has become critical in order to complete the training in an acceptable time. However, there are two obstacles to developing a scalable parallel CNN in a distributed-memory computing environment. One is the high degree of data dependency exhibited in the model parameters across every two adjacent minibatches and the other is the large amount of data to be transferred across the communication channel. In this paper, we present a parallelization strategy that maximizes the overlap of inter-process communication with the computation. The overlapping is achieved by using a thread per compute node to initiate communication after the gradients are available. The output data of backpropagation stage is generated at each model layer, and the communication for the data can run concurrently with the computation of other layers. To study the effectiveness of the overlapping and its impact on the scalability, we evaluated various model architectures and hyperparameter settings. When training VGG-A model using ImageNet data sets, we achieve speedups of 62.97× and 77.97× on 128 compute nodes using mini-batch sizes of 256 and 512, respectively.
机译:训练卷积神经网络(CNN)是一项计算量很大的任务,其并行化对于在可接受的时间内完成训练至关重要。但是,在分布式内存计算环境中开发可伸缩并行CNN有两个障碍。一个是模型参数在每两个相邻小批处理之间表现出的高度数据依赖性,另一个是要在整个通信通道上传输的大量数据。在本文中,我们提出了一种并行化策略,该策略将进程间通信与计算的重叠最大化。可以通过在每个梯度可用之后使用每个计算节点的线程来发起通信来实现重叠。反向传播阶段的输出数据在每个模型层生成,并且数据通信可以与其他层的计算同时进行。为了研究重叠的有效性及其对可伸缩性的影响,我们评估了各种模型架构和超参数设置。当使用ImageNet数据集训练VGG-A模型时,我们分别使用256和512的小批量大小在128个计算节点上实现了62.97倍和77.97倍的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号