Annealed Gradient Descent for Deep Learning

机译：深度学习的退火梯度下降

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Stochastic gradient descent (SGD) has been regarded as a successful optimization algorithm in machine learning. In this paper, we propose a novel annealed gradient descent (AGD) method for non-convex optimization in deep learning. AGD optimizes a sequence of gradually improved smoother mosaic functions that approximate the original non-convex objective function according to an annealing schedule during the optimization process. We present a theoretical analysis on its convergence properties and learning speed. The proposed AGD algorithm is applied to learning deep neural networks (DNNs) for image recognition on MNIST and speech recognition on Switchboard. Experimental results have shown that AGD can yield comparable performance as SGD but it can significantly expedite training of DNNs in big data sets (by about 40% faster).

机译：随机梯度下降（SGD）被认为是机器学习中成功的优化算法。在本文中，我们提出了一种新颖的退火梯度下降（AGD）方法，用于深度学习中的非凸优化。 AGD根据优化过程中的退火计划，优化了逐渐改善的平滑镶嵌函数序列，这些序列近似于原始的非凸目标函数。我们对其收敛性和学习速度进行理论分析。所提出的AGD算法被应用于学习深度神经网络（DNN），以在MNIST上进行图像识别和在Switchboard上进行语音识别。实验结果表明，AGD可以提供与SGD相当的性能，但是它可以显着加快大数据集中DNN的训练速度（快40％）。

著录项

来源
《Conference on uncertainty in artificial intelligence》|2015年|652-661|共10页
会议地点
作者
Hengyue Pan; Hui Jiang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Annealed gradient descent for deep learning [J] . Neurocomputing . 2020,第Mara7期

机译：深度学习的退火梯度下降
2. Deep learning for sea cucumber detection using stochastic gradient descent algorithm [J] . Huaqiang Zhang, Fusheng Yu, Jincheng Sun, European Journal of Remote Sensing . 2020,第sup1期

机译：利用随机梯度下降算法对海参检测深度学习
3. Layer-wise learning based stochastic gradient descent method for the optimization of deep convolutional neural network [J] . Zheng Qinghe, Tian Xinyu, Jiang Nan, Journal of intelligent & fuzzy systems: Applications in Engineering and Technology . 2019,第4aPta2期

机译：基于层性学习的随机梯度渐变方法，用于优化深卷积神经网络
4. Annealed Gradient Descent for Deep Learning [C] . Hengyue Pan, Hui Jiang Conference on uncertainty in artificial intelligence . 2015

机译：退火梯度下降深入学习
5. A Multi-Task Learning Method Using Gradient Descent with Applications [D] . Larson, Nathan Dean. 2021

机译：一种使用梯度血液与应用程序的多任务学习方法
6. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练
7. Learning dynamics of gradient descent optimization in deep neural networks [O] . Wei Wu, Xiaoyuan Jing, Wencai Du, 2021

机译：深神经网络梯度下降优化的学习动态

Annealed Gradient Descent for Deep Learning

摘要

著录项

相似文献

相关主题

期刊订阅