Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks

机译：分布式深度卷积神经网络的理论可扩展性分析

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We analyze the asymptotic performance of the training process of deep neural networks (NN) on clusters in order to determine the scalability. For this purpose, i) we assume a data parallel implementation of the training algorithm, which distributes the batches among the cluster nodes and replicates the model; ii) we leverage the roofline model to inspect the performance at the node level, taking into account the floating-point unit throughput and memory bandwidth; and iii) we consider distinct collective communication schemes that are optimal depending on the message size and underlying network interconnection topology. We then apply the resulting performance model to analyze the scalability of several well-known deep convolutional neural networks as a function of the batch size, node floating-point throughput, node memory bandwidth, cluster dimension, and link bandwidth.

机译：为了确定可伸缩性，我们分析了深度神经网络（NN）训练过程在群集上的渐近性能。为此，i）假设训练算法是数据并行实现的，该算法在群集节点之间分配批次并复制模型; ii）考虑到浮点单元的吞吐量和内存带宽，我们利用roofline模型检查节点级别的性能; iii）我们考虑不同的集体通信方案，这些方案根据消息大小和底层网络互连拓扑而最佳。然后，我们使用所得的性能模型来分析多个众所周知的深度卷积神经网络的可伸缩性，该可伸缩性是批处理大小，节点浮点吞吐量，节点内存带宽，集群维度和链接带宽的函数。

著录项

来源
《IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing》|2019年|534-541|共8页
会议地点
作者
Adrián Castelló; Manuel F. Dolz; Enrique S. Quintana-Ortí; José Duato;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
convolutional neural nets;

机译：卷积神经网络;

相似文献

外文文献
中文文献
专利

1. Deep Convolutional Neural Networks and Transfer Learning for Measuring Cognitive Impairment Using Eye-Tracking in a Distributed Tablet-Based Environment [J] . Rafi U. Haque, Alvince L. Pongos, Cecelia M. Manzanares, Biomedical Engineering, IEEE Transactions on . 2021,第1期

机译：基于分布式平板电脑环境中的眼睛跟踪测量认知障碍的深度卷积神经网络与转移学习
2. Efficient Hyperparameter Optimization for Convolution Neural Networks in Deep Learning: A Distributed Particle Swarm Optimization Approach [J] . Yu Guo, Jian-Yu Li, Zhi-Hui Zhan Cybernetics and Systems . 2021,第1a4期

机译：深度学习中卷积神经网络的高效覆盖计优化：分布式粒子群优化方法
3. Towards improving the convolutional neural networks for deep learning using the distributed artificial bee colony method [J] . Banharnsakun Anan International journal of machine learning and cybernetics . 2019,第6期

机译：致力于使用分布式人工蜂群方法改进卷积神经网络以进行深度学习
4. Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks [C] . Adrián Castelló, Manuel F. Dolz, Enrique S. Quintana-Ortí, IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing . 2019

机译：分布式深度卷积神经网络的理论可扩展性分析
5. Exploring the Design Space of Deep Convolutional Neural Networks at Large Scale. [D] . Iandola, Forrest. 2016

机译：大规模探索深度卷积神经网络的设计空间。
6. 3D multi-scale deep convolutional neural networks for pulmonary nodule detection [O] . Haixin Peng, Huacong Sun, Yanfei Guo 2021

机译：3D多尺度深卷积神经网络用于肺结核检测
7. Large-Scale Distributed Second-Order Optimization Using Kronecker-Factored Approximate Curvature for Deep Convolutional Neural Networks [O] . Kazuki Osawa, Yohei Tsuji, Yuichiro Ueno, 2019

机译：使用Kronecker-Ifferoored近似曲率进行大规模分布的二阶优化，用于深卷积神经网络

Theoretical Scalability Analysis of Distributed Deep Convolutional Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅