Asynchronous, Data-Parallel Deep Convolutional Neural Network Training with Linear Prediction Model for Parameter Transition

机译：线性预测模型用于参数转换的异步数据并行深度卷积神经网络训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Recent studies have revealed that Convolutional Neural Networks requiring vastly many sum-of-product operations with relatively small numbers of parameters tend to exhibit great model performances. Asynchronous Stochastic Gradient Descent provides a possibility of large-scale distributed computation for training such networks. However, asynchrony introduces stale gradients, which are considered to have negative effects on training speed. In this work, we propose a method to predict future parameters during the training to mitigate the drawback of staleness. We show that the proposed method gives good parameter prediction accuracies that can improve speed of asynchronous training. The experimental results on ImageNet demonstrates that the proposed asynchronous training method, compared to a synchronous training method, reduces the training time to reach a certain model accuracy by a factor of 1.9 with 256 GPUs used in parallel.

机译：最近的研究表明，需要大量的乘积和运算且参数数量相对较少的卷积神经网络往往表现出出色的模型性能。异步随机梯度下降为训练此类网络提供了大规模分布式计算的可能性。但是，异步会引入陈旧的梯度，这被认为会对训练速度产生负面影响。在这项工作中，我们提出了一种在训练期间预测未来参数的方法，以减轻过时的缺点。我们表明，该方法提供了良好的参数预测精度，可以提高异步训练的速度。在ImageNet上的实验结果表明，与同步训练方法相比，与256个GPU并行使用时，与同步训练方法相比，所提出的异步训练方法将训练时间减少了达到一定模型精度的1.9倍。

著录项

来源
《International conference on neural information processing》|2017年|305-314|共10页
会议地点
作者
Ikuro Sato; Ryo Fujisaki; Yosuke Oyama; Akihiro Nomura; Satoshi Matsuoka;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Traffic Network Flow Prediction Using Parallel Training for Deep Convolutional Neural Networks on Spark Cloud [J] . Zhang Yongnan, Zhou Yonghua, Lu Huapu, IEEE transactions on industrial informatics . 2020,第12期

机译：在火花云上使用平行训练的交通网络流预测
2. Smart deep learning based wind speed prediction model using wavelet packet decomposition, convolutional neural network and convolutional long short term memory network [J] . Liu Hui, Mi Xiwei, Li Yanfei Energy Conversion & Management . 2018,第JUNa期

机译：基于小波包分解，卷积神经网络和卷积长期短期记忆网络的基于智能深度学习的风速预测模型
3. Evaluation of Key Parameters Using Deep Convolutional Neural Networks for Airborne Pollution (PM10) Prediction [J] . Marco Antonio Aceves-Fernández, Ricardo Domínguez-Guevara, Jesus Carlos Pedraza-Ortega, Discrete dynamics in nature and society . 2020,第4期

机译：利用深卷积神经网络对空机污染（PM10）预测评估关键参数
4. Asynchronous, Data-Parallel Deep Convolutional Neural Network Training with Linear Prediction Model for Parameter Transition [C] . Ikuro Sato, Ryo Fujisaki, Yosuke Oyama, International Conference on Neural Information Processing . 2017

机译：具有用于参数转换的线性预测模型的异步，数据并行深卷积神经网络训练
5. Pipelined Training with Stale Weights of Deep Convolutional Neural Networks [D] . ?Zhang, Lifu 2020

机译：流水线训练与深卷积神经网络的陈旧重量
6. DeepSeqPan a novel deep convolutional neural network model for pan-specific class I HLA-peptide binding affinity prediction [O] . Zhonghao Liu, Yuxin Cui, Zheng Xiong, -1

机译：DeepSeqPan一种新颖的深度卷积神经网络模型用于泛特异性I类HLA肽结合亲和力预测
7. Evaluation of Key Parameters Using Deep Convolutional Neural Networks for Airborne Pollution (PM10) Prediction [O] . Marco Antonio Aceves-Fernández, Ricardo Domínguez-Guevara, Jesus Carlos Pedraza-Ortega, 2020

机译：利用深卷积神经网络对空机污染（PM10）预测评估关键参数

Asynchronous, Data-Parallel Deep Convolutional Neural Network Training with Linear Prediction Model for Parameter Transition

摘要

著录项

相似文献

相关主题

期刊订阅