Ensemble Kalman Filter Optimizing Deep Neural Networks: An Alternative Approach to Non-performing Gradient Descent

机译：Ensemble Kalman滤波器优化深神经网络：非执行梯度下降的替代方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The successful training of deep neural networks is dependent on initialization schemes and choice of activation functions. Non-optimally chosen parameter settings lead to the known problem of exploding or vanishing gradients. This issue occurs when gradient descent and backpropagation are applied. For this setting the Ensemble Kalman Filter (EnKF) can be used as an alternative optimizer when training neural networks. The EnKF does not require the explicit calculation of gradients or adjoints and we show this resolves the exploding and vanishing gradient problem. We analyze different parameter initializations, propose a dynamic change in ensembles and compare results to established methods.

机译：深度神经网络的成功培训取决于初始化方案和激活功能的选择。非最佳选择的参数设置导致爆炸或消失渐变的已知问题。应用梯度血统和逆产后发生此问题。对于此设置，Ensemble Kalman滤波器（ENKF）可在培训神经网络时用作替代优化器。 ENKF不需要明确计算渐变或伴侣，我们展示了解决爆炸和消失的梯度问题。我们分析了不同的参数初始化，提出了集合的动态变化，并将结果与已建立的方法进行比较。

著录项

来源
《International Conference on Machine Learning, Optimization, and Data Science》|2020年|78-92|共15页
会议地点
作者
Alper Yegenoglu; Kai Krajsek; Sandra Diaz Pier; Michael Herty;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Deep Neural Networks; Kalman Filter; Activation Function; Vanishing Gradients; Initialization;

机译：深神经网络;卡尔曼滤波器;激活功能;消失的渐变;初始化;

相似文献

外文文献
中文文献
专利

1. A Nonlinear Gradient Domain-Guided Filter Optimized by Fractional-Order Gradient Descent with Momentum RBF Neural Network for Ship Image Dehazing [J] . Qionglin Fang, X. U. E. Han Journal of Sensors . 2021,第a期

机译：由分数级梯度下降，用动量RBF神经网络进行优化的非线性梯度域引导滤波器，用于船舶图像去吸附
2. Stochastic Gradient Descent–Whale Optimization Algorithm-Based Deep Convolutional Neural Network To Crowd Emotion Understanding [J] . Avinash Ratre The Computer journal . 2020,第2CD期

机译：基于随机梯度下降-鲸鱼优化算法的深度卷积神经网络对人群情感理解
3. Stochastic Gradient Descent–Whale Optimization Algorithm-Based Deep Convolutional Neural Network To Crowd Emotion Understanding [J] . Avinash Ratre, Yannis Manolopoulos The Computer Journal . 2020,第1期

机译：基于随机梯度下降 - 鲸鲸优化算法的人群情绪理解的深度卷积神经网络
4. Comparison of gradient descent method, Kalman filtering and decoupled Kalman in training neural networks used for fingerprint-based positioning [C] . Takenga, C.M., Anne, . 2004

机译：梯度下降法，卡尔曼滤波和解耦卡尔曼在基于指纹定位的训练神经网络中的比较
5. Hybrid - Nudging Ensemble Kalman Filter and Ensemble Adjustment Kalman Filter Approach to Subsurface Water Contaminant Transport Modeling. [D] . Hokey, Wisdom Mawuli. 2016

机译：混合-集成整体卡尔曼滤波和整体调整卡尔曼滤波方法在地下水污染物运移建模中的应用。
6. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练
7. Learning dynamics of gradient descent optimization in deep neural networks [O] . Wei Wu, Xiaoyuan Jing, Wencai Du, 2021

机译：深神经网络梯度下降优化的学习动态

Ensemble Kalman Filter Optimizing Deep Neural Networks: An Alternative Approach to Non-performing Gradient Descent

摘要

著录项

相似文献

相关主题

期刊订阅