An Adaptive Gradient Method with Differentiation Element in Deep Neural Networks

机译：深度神经网络中带有微分元素的自适应梯度法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Current adaptive gradient algorithm (such as Adam) used in deep neural network has the advantages of fast training speed, simple tuning task and high computational efficiency. However, these methods are usually based on the gradient update using the root mean square of the past gradient, which often causes the learning rate shock. Thus the model overshoot may be large and even cannot converge. The PID optimization algorithm for deep neural network provides a new way to solve this problem. It introduces the idea of automatic control to solve the problem of overshooting in the stochastic gradient algorithm. The Adam algorithm is similar to an adaptive PI controller. Inspired by this, the differentiation element is introduced into Adam algorithm to accelerate model convergence. The algorithm was tested on MNIST, Cifar-10, Cifar-100 and Tiny-ImageNet data sets in the section of experiment. It is shown that the training speed by 10% on the premise of guaranteeing the accuracy of the model.

机译：目前在深度神经网络中使用的自适应梯度算法（例如Adam）具有训练速度快，调整任务简单，计算效率高的优点。但是，这些方法通常基于使用过去梯度的均方根的梯度更新，这通常会导致学习率震荡。因此，模型超调可能很大，甚至无法收敛。深度神经网络的PID优化算法为解决该问题提供了一种新途径。它介绍了自动控制的思想，以解决随机梯度算法中的过冲问题。 Adam算法类似于自适应PI控制器。受此启发，微分元素被引入亚当算法以加速模型收敛。在实验部分中，该算法在MNIST，Cifar-10，Cifar-100和Tiny-ImageNet数据集上进行了测试。结果表明，在保证模型准确性的前提下，训练速度提高了10％。

著录项

来源
《IEEE Conference on Industrial Electronics and Applications》|2020年|1582-1587|共6页
会议地点
作者
Runqi Wang; Wei Wang; Teli Ma; Baochang Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
adaptive optimizer; gradient descent; differentiation element;

机译：自适应优化器;梯度下降;微分元;

相似文献

外文文献
中文文献
专利

1. Handwritten Devanagari Character Recognition Using Layer-Wise Training of Deep Convolutional Neural Networks and Adaptive Gradient Methods [J] . Mahesh Jangid, Sumit Srivastava Journal of Imaging . 2018,第2期

机译：深度卷积神经网络的分层明智训练和自适应梯度法的手写体梵文字符识别
2. Crafting adversarial example with adaptive root mean square gradient on deep neural networks [J] . Xiao Yatie, Pun Chi-Man, Liu Bo Neurocomputing . 2020,第May14期

机译：在深神经网络上具有自适应根均方梯度的对抗对抗示例
3. Head pose estimation in the wild using Convolutional Neural Networks and adaptive gradient methods [J] . Patacchiola Massimiliano, Cangelosi Angelo Pattern Recognition: The Journal of the Pattern Recognition Society . 2017,第期

机译：使用卷积神经网络和自适应梯度方法在野外姿势估计
4. Performance Analysis of Gradient Descent Methods for Classification of Oranges using Deep Neural Network [C] . Pooja Pathak, Himanshu Gangwar, Anand Singh Jalal International Conference on Computing for Sustainable Global Development . 2020

机译：基于深度神经网络的橙子梯度下降法分类的性能分析
5. Geometric Properties of the Gradient of Loss Functions in Discriminant Deep Neural Networks [D] . Li, Li. 2021

机译：判别深神经网络中损耗函数梯度的几何特性
6. An Adaptive Multi-Sensor Data Fusion Method Based on Deep Convolutional Neural Networks for Fault Diagnosis of Planetary Gearbox [O] . Luyang Jing, Taiyong Wang, Ming Zhao, 2017

机译：基于深度卷积神经网络的自适应多传感器数据融合方法在行星齿轮箱故障诊断中的应用
7. Handwritten Devanagari Character Recognition Using Layer-Wise Training of Deep Convolutional Neural Networks and Adaptive Gradient Methods [O] . Mahesh Jangid, Sumit Srivastava 2018

机译：使用深度卷积神经网络和自适应梯度方法的层面训练手写的Devanagari字符识别
8. Nonlinear Adaptive Control Using Neural Networks: Estimation with a Smoothed Formof Simultaneous Perturbation Gradient Approximation [R] . Spall, J. C., Cristion, J. A. 1994

机译：基于神经网络的非线性自适应控制：具有同时扰动梯度近似的光滑形式的估计

An Adaptive Gradient Method with Differentiation Element in Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅