Optimal memory-aware backpropagation of deep join networks

Olivier Beaumont; Julien Herrmann; Guillaume Pallez(Aupy); Alena Shilova

首页> 外文期刊>Philosophical transactions of the Royal Society. Mathematical, physical, and engineering sciences >Optimal memory-aware backpropagation of deep join networks

【24h】

Optimal memory-aware backpropagation of deep join networks

机译：最佳内存感知深加入网络的逆产

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Deep learning training memory needs can prevent the user from considering large models and large batch sizes. In this work, we propose to use techniques from memory-aware scheduling and automatic differentiation (AD) to execute a backpropagation graph with a bounded memory requirement at the cost of extra recomputations. The case of a single homogeneous chain, i.e. the case of a network whose stages are all identical and form a chain, is well understood and optimal solutions have been proposed in the AD literature. The networks encountered in practice in the context of deep learning are much more diverse, both in terms of shape and heterogeneity. In this work, we define the class of backpropagation graphs, and extend those on which one can compute in polynomial time a solution that minimizes the total number of recomputations. In particular, we consider join graphs which correspond to models such as siamese or cross-modal networks. This article is part of a discussion meeting issue 'Numerical algorithms for high-performance computational science'.

机译：深度学习培训记忆需求可以防止用户考虑大型型号和大批量尺寸。在这项工作中，我们建议使用来自内存感知的调度和自动差异化（AD）的技术，以执行额外重新计算成本的有界存储器要求的反向译。单一均链的情况，即阶段全部相同并形成链的网络的情况，很好地理解，并且在广告文献中提出了最佳解决方案。在深入学习的背景下在实践中遇到的网络在形状和异质性方面都是多样化的。在这项工作中，我们定义了BackPropagation图表的类，并扩展了一个可以在多项式时间中计算的类，该解决方案最小化重新计算总数。特别是，我们考虑连接图，该图对应于暹罗或跨模型网络等模型。本文是讨论会议问题的高性能计算科学的数值算法的一部分。

著录项

来源
《Philosophical transactions of the Royal Society. Mathematical, physical, and engineering sciences》 |2020年第2166期|共14页
作者
Olivier Beaumont; Julien Herrmann; Guillaume Pallez(Aupy); Alena Shilova;
展开▼
作者单位

Inria Labri University of Bordeaux Talence France;

Inria Labri University of Bordeaux Talence France;

Inria Labri University of Bordeaux Talence France;

Inria Labri University of Bordeaux Talence France;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数学;物理学;一般工业技术;
关键词
backpropagation; memory; pebble game;

机译：backpropagation;记忆;鹅卵石游戏;

相似文献

外文文献
中文文献
专利

1. Optimal memory-aware backpropagation of deep join networks [J] . Olivier Beaumont, Julien Herrmann, Guillaume Pallez(Aupy), Philosophical transactions of the Royal Society. Mathematical, physical, and engineering sciences . 2020,第2166期

机译：最佳内存感知深加入网络的逆产
2. Optimal memory-aware Sensor Network Gossiping (or how to break the Broadcast lower bound) [J] . Martin Farach-Colton, Antonio Fernandez Anta, Miguel A. Mosteiro Theoretical computer science . 2013,第Null期

机译：最佳的内存感知型传感器网络闲聊（或如何打破广播下限）
3. DDoS detection in 5G-enabled loT networks using deep Kalman backpropagation neural network [J] . Almiani Muder, AbuGhazleh Alia, Jararweh Yaser, International journal of machine learning and cybernetics . 2021,第11期

机译：使用Deep Kalman Backpropagation神经网络中启用了5G的批次网络中的DDOS检测
4. Mean Field Theory for Deep Dropout Networks: Digging up Gradient Backpropagation Deeply [C] . Wei Huang, Richard Yi Da Xu, Weitao Du, European Conference on Artificial Intelligence;Conference on Prestigious Applications of Intelligent Systems . 2020

机译：深度辍学网络的平均场理论：深入挖掘梯度背部
5. Optimizing Inference in Bayesian Networks: From Join Tree Propagation to Deep Learning [D] . dos Santos, Andre Evaristo. 2020

机译：优化推断在贝叶斯网络中：从加入树传播到深度学习
6. Generative Deep Neural Networks for Inverse Materials Design Using Backpropagation and Active Learning [O] . Chun‐Teh Chen, Grace X. Gu 2020

机译：使用反向传播和主动学习的逆向材料设计的生成型深度神经网络
7. Training deep spiking neural networks using backpropagation [O] . Lee, Jun Haeng, Delbruck, Tobi, Pfeiffer, Michael 2016

机译：使用反向传播训练深峰神经网络

Optimal memory-aware backpropagation of deep join networks

摘要

著录项

相似文献

相关主题

期刊订阅