Why RELU Units Sometimes Die: Analysis of Single-Unit Error Backpropagation in Neural Networks

机译：为什么RELU单元有时会死：神经网络中的单单元误差反向传播分析

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, neural networks in machine learning use rectified linear units (ReLUs) in early processing layers for better performance. Training these structures sometimes results in “dying ReLU units” with near-zero outputs. We first explore this condition via simulation using the CIFAR-10 dataset and variants of two popular convolutive neural network architectures. Our explorations show that the output activation probability Pr[y > 0] is generally less than 0.5 at system convergence for layers that do not employ skip connections, and this activation probability tends to decrease as one progresses from input layer to output layer. Employing a simplified model of a single ReLU unit trained by a variant of error backpropagation, we then perform a statistical convergence analysis to explore the model's evolutionary behavior. Our analysis describes the potentially-slower convergence speeds of dying ReLU units, and this issue can occur regardless of how the weights are initialized.

机译：最近，机器学习中的神经网络在早期处理层中使用整流线性单位（ReLU），以获得更好的性能。训练这些结构有时会导致“濒死的ReLU单元”的输出接近于零。我们首先通过使用CIFAR-10数据集和两种流行的卷积神经网络体系结构的变体进行仿真来探索这种情况。我们的探索表明，对于不使用跳过连接的层，在系统收敛时，输出激活概率Pr [y> 0]通常小于0.5，并且随着从输入层到输出层的发展，该激活概率趋于降低。我们使用由错误反向传播的变体训练的单个ReLU单元的简化模型，然后执行统计收敛分析，以探索模型的演化行为。我们的分析描述了即将死去的ReLU单元的收敛速度可能会降低，并且无论如何初始化权重，都可能出现此问题。

著录项

来源
《2018 52nd Asilomar Conference on Signals, Systems, and Computers》|2018年|864-868|共5页
会议地点 Pacific Grove(US)
作者
Scott C. Douglas; Jiutian Yu;
展开▼
作者单位

Department of Electrical and Computer Engineering Dallas, Southern Methodist University, Texas, 75275, USA;

Department of Electrical and Computer Engineering Dallas, Southern Methodist University, Texas, 75275, USA;

展开▼
会议组织
原文格式 PDF
正文语种 eng
中图分类
关键词
Training; Convergence; Analytical models; Computer architecture; Neural networks; Machine learning; Probability;

机译：培训;收敛;分析模型;计算机体系结构;神经网络;机器学习;概率;;

相似文献

外文文献
中文文献
专利

1. Backpropagation neural network as earthquake early warning tool using a new modified elementary Levenberg–Marquardt Algorithm to minimise backpropagation errors [J] . Lin Jyh-Woei, Chao Chun-Tang, Chiou Juing-Shian Geoscientific Instrumentation, Methods and Data Systems . 2018,第3期

机译：反向传播神经网络作为地震预警工具，使用新的改进的基本Levenberg-Marquardt算法将反向传播误差降至最低
2. Backpropagation neural network as earthquake early warning tool using a new modified elementary Levenberg–Marquardt Algorithm to minimise backpropagation errors [J] . Lin Jyh-Woei, Chao Chun-Tang, Chiou Juing-Shian Geoscientific Instrumentation, Methods and Data Systems Discussions . 2018,第3期

机译：Backpropagation神经网络作为地震预警工具使用新的修改基本的levenberg-Marquardt算法，以最大限度地减少BackPropagation错误
3. A New Error Backpropagation Learning Algorithm for a Layered Neural Network with Nondifferentiable Units [J] . Hidenori Naganuma, Takahumi Oohori, Kazuhisa Watanabe Electronics and Communications in Japan. Part 3, Fundamental Electronic Science . 2007,第5期

机译：具有不可分单元的分层神经网络的一种新的误差反向传播学习算法
4. WHY RELU UNITS SOMETIMES DIE: ANALYSIS OF SINGLE-UNIT ERROR BACKPROPAGATION IN NEURAL NETWORKS [C] . Scott C. Douglas, Jiutian Yu Asilomar Conference on Signals, Systems, and Computers . 2018

机译：为什么Relu单位有时会死：神经网络中单个单元错误反向的分析
5. Recurrent neural networks: Error surface analysis and improved training. [D] . Phan, Manh C. 2014

机译：递归神经网络：错误表面分析和改进的培训。
6. The Analysis of Trajectory Control of Non-holonomic Mobile Robots Based on Internet of Things Target Image Enhancement Technology and Backpropagation Neural Network [O] . Lanfei Zhao, Ganlin Wang, Xiaosong Fan, 2021

机译：基于事物互联网瞄准图像增强技术与背部化神经网络的非正度移动机器人轨迹控制分析
7. Why RELU Units Sometimes Die: Analysis of Single-Unit Error Backpropagation in Neural Networks [O] . Scott C. Douglas, Jiutian Yu 2018

机译：为什么Relu单位有时会死：神经网络中单个单元错误反向的分析

Why RELU Units Sometimes Die: Analysis of Single-Unit Error Backpropagation in Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅