FireCaffe: Near-Linear Acceleration of Deep Neural Network Training on Compute Clusters

机译：FireCaffe：在计算群集上进行深度神经网络训练的近线性加速

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Long training times for high-accuracy deep neural networks (DNNs) impede research into new DNN architectures and slow the development of high-accuracy DNNs. In this paper we present FireCaffe, which successfully scales deep neural network training across a cluster of GPUs. We also present a number of best practices to aid in comparing advancements in methods for scaling and accelerating the training of deep neural networks. The speed and scalability of distributed algorithms is almost always limited by the overhead of communicating between servers, DNN training is not an exception to this rule. Therefore, the key consideration here is to reduce communication overhead wherever possible, while not degrading the accuracy of the DNN models that we train. Our approach has three key pillars. First, we select network hardware that achieves high bandwidth between GPU servers - Infiniband or Cray interconnects are ideal for this. Second, we consider a number of communication algorithms, and we find that reduction trees are more efficient and scalable than the traditional parameter server approach. Third, we optionally increase the batch size to reduce the total quantity of communication during DNN training, and we identify hyperparameters that allow us to reproduce the small-batch accuracy while training with large batch sizes. When training GoogLeNet and Network-in-Network on ImageNet, we achieve a 47x and 39x speedup, respectively, when training on a cluster of 128 GPUs.

机译：高精度深度神经网络（DNN）的培训时间长，阻碍了对新DNN体系结构的研究，并减慢了高精度DNN的开发速度。在本文中，我们介绍了FireCaffe，它成功地在一组GPU上扩展了深度神经网络训练。我们还提出了一些最佳实践，以帮助比较缩放和加速深度神经网络训练方法的进步。分布式算法的速度和可伸缩性几乎总是受到服务器之间通信开销的限制，DNN训练也不是该规则的例外。因此，此处的主要考虑因素是在不降低我们训练的DNN模型的准确性的情况下，尽可能减少通信开销。我们的方法具有三个主要支柱。首先，我们选择可在GPU服务器之间实现高带宽的网络硬件-Infiniband或Cray互连是理想的选择。其次，我们考虑了许多通信算法，并且发现减少树比传统的参数服务器方法更有效，更可伸缩。第三，我们可以选择增加批次大小以减少DNN训练期间的通信总量，并确定超参数，这些参数可以使我们在以大批次训练时重现小批次的准确性。在ImageNet上训练GoogLeNet和网络中的网络时，在128个GPU的集群上进行训练时，我们分别实现了47倍和39倍的加速。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition》|2016年|2592-2600|共9页
会议地点
作者
Forrest N. Iandola; Matthew W. Moskewicz; Khalid Ashraf; Kurt Keutzer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Training; Computer architecture; Graphics processing units; Servers; Neural networks; Computational modeling; Parallel processing;

机译：培训;计算机体系结构;图形处理单元;服务器;神经网络;计算建模;并行处理;

相似文献

外文文献
中文文献
专利

1. Design and characterization of superconducting nanowire-based processors for acceleration of deep neural network training [J] . Onen Murat, Butters Brenden A., Toomey Emily, Nanotechnology . 2020,第2期

机译：基于超导纳米线加速度的深度神经网络训练的设计与鉴定
2. Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices: Design Considerations [J] . Tayfun Gokmen, Yurii Vlasov Frontiers in Neuroscience . 2016,第1期

机译：电阻交叉点设备加速深度神经网络训练：设计注意事项
3. RazorNet: Adversarial Training and Noise Training on a Deep Neural Network Fooled by a Shallow Neural Network [J] . Shayan Taheri, Milad Salem, Jiann-Shiun Yuan Big Data and Cognitive Computing . 2019,第3期

机译：razornet：浅神经网络愚弄的深层神经网络上的对抗训练和噪声训练
4. FireCaffe: near-linear acceleration of deep neural network training on compute clusters [C] . Forrest N. Iandola, Matthew W. Moskewicz, Khalid Ashraf, IEEE Conference on Computer Vision and Pattern Recognition . 2016

机译：Firecaffe：计算集群中深度神经网络训练的近线性加速度
5. Emerging Opportunities in Machine Learning Hardware Acceleration: From Advanced Neural Networks Implementation to Ultra-efficient Deep Learning Framework Using Next Generation Technology [D] . ?Cai, Ruizhe 2020

机译：机器学习硬件加速的新兴机会：从先进的神经网络实现，使用下一代技术实现超高效的深度学习框架
6. Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices: Design Considerations [O] . Tayfun Gokmen, Yurii Vlasov 2016

机译：电阻交叉点设备加速深度神经网络训练：设计注意事项
7. FireCaffe: near-linear acceleration of deep neural network training on compute clusters [O] . Iandola, Forrest N., Ashraf, Khalid, Moskewicz, Matthew W., 2016

机译：FireCaffe：深度神经网络训练的近线性加速度计算集群

FireCaffe: Near-Linear Acceleration of Deep Neural Network Training on Compute Clusters

摘要

著录项

相似文献

相关主题

期刊订阅