首页> 外文会议>IEEE International Conference on High Performance Computing >Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication
【24h】

Parallel Deep Convolutional Neural Network Training by Exploiting the Overlapping of Computation and Communication

机译:利用计算与通信重叠并行深度卷积神经网络培训

获取原文

摘要

Training Convolutional Neural Network (CNN) is a computationally intensive task whose parallelization has become critical in order to complete the training in an acceptable time. However, there are two obstacles to developing a scalable parallel CNN in a distributed-memory computing environment. One is the high degree of data dependency exhibited in the model parameters across every two adjacent minibatches and the other is the large amount of data to be transferred across the communication channel. In this paper, we present a parallelization strategy that maximizes the overlap of inter-process communication with the computation. The overlapping is achieved by using a thread per compute node to initiate communication after the gradients are available. The output data of backpropagation stage is generated at each model layer, and the communication for the data can run concurrently with the computation of other layers. To study the effectiveness of the overlapping and its impact on the scalability, we evaluated various model architectures and hyperparameter settings. When training VGG-A model using ImageNet data sets, we achieve speedups of 62.97× and 77.97× on 128 compute nodes using mini-batch sizes of 256 and 512, respectively.
机译:培训卷积神经网络(CNN)是一种计算密集型任务,其并行化已经变得至关重要,以便在可接受的时间内完成培训。然而,在分布式记忆计算环境中开发可扩展并行CNN存在两个障碍。一个是在每两个相邻的小匹匹匹匹匹旁的模型参数中展出的高度数据依赖性,另一个是要在通信信道上传送的大量数据。在本文中,我们介绍了一种并行化策略,可以最大化与计算的过程间通信的重叠。通过使用每个计算节点的线程实现重叠以在梯度可用之后启动通信。 BackPropagation阶段的输出数据在每个模型层生成,并且数据的通信可以与其他层的计算同时运行。为研究重叠的有效性及其对可扩展性的影响,我们评估了各种模型架构和封立参数设置。使用ImageNet数据集培训VGG-A模型时,我们可以分别在128个计算节点上达到62.97×和77.97×的速度分别使用256和512的Mini-Batch Size。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号