Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation

机译：重新审视用回到传播训练的多层训练的重量初始化问题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the main reasons for the slow convergence and the suboptimal generalization results of MLP (Multilayer Perceptrons) based on gradient descent training is the lack of a proper initialization of the weights to be adjusted. Even sophisticated learning procedures are not able to compensate for bad initial values of weights, while good initial guess leads to fast convergence and or better generalization capability even with simple gradient-based error minimization techniques. Although initial weight space in MLPs seems so critical there is no study so far of its properties with regards to which regions lead to solutions or failures concerning generalization and convergence in real world problems. There exist only some preliminary studies for toy problems, like XOR. A data mining approach, based on Self Organizing Feature Maps (SOM), is involved in this paper to demonstrate that a complete analysis of the MLP weight space is possible. This is the main novelty of this paper. The conclusions drawn from this novel application of SOM algorithm in MLP analysis extend significantly previous preliminary results in the literature. MLP initialization procedures are overviewed along with all conclusions so far drawn in the literature and an extensive experimental study on more representative tasks, using our data mining approach, reveals important initial weight space properties of MLPs, extending previous knowledge and literature results.

机译：基于梯度下降训练的MLP（MultiDayer Perceptrons）的慢收敛和次优化结果的主要原因之一是缺乏要调整的权重的正确初始化。即使是复杂的学习程序也无法补偿重量的差值不良，而良好的初步猜测也会导致快速收敛和更好的泛化能力，即使具有简单的基于梯度的误差最小化技术。虽然MLP中的初始重量空间似乎如此至关重要，但到目前为止还没有研究其在哪些地区导致关于现实世界问题的泛化和收敛的解决方案或失败。只存在一些对玩具问题的初步研究，如XOR。基于自组织特征映射（SOM）的数据挖掘方法涉及本文，以证明对MLP重量空间的完全分析是可能的。这是本文的主要新颖性。从MLP分析中的SOM算法的这种新应用得出的结论在文献中显着延伸了先前的初步结果。概述了MLP初始化程序以及在文献中绘制的所有结论以及使用我们的数据挖掘方法的更多代表性任务的广泛实验研究，揭示了MLP的重要初始重量空间特性，延长了先前的知识和文献结果。

著录项

来源
《International Conference on Neural Information Processing》|2009年||共8页
会议地点
作者
Stavros Adam; Dimitrios Alexios Karras; Michael N. Vrahatis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP3-53;
关键词

相似文献

外文文献
中文文献
专利

1. Invariant Set of Weight of Perceptron Trained by Perceptron Training Algorithm [J] . Ho C. Y.-F., Ling B. W.-K., Iu H. H.-C. Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on . 2010,第6期

机译：感知器训练算法训练的感知器权重不变集
2. Hearing loss detection by discrete wavelet transform and multi-layer perceptron trained by nature-inspired algorithms [J] . Jingyuan Yang, Vishnu Varthanan Govindaraj, Ming Yang, Multimedia Tools and Applications . 2020,第21a22期

机译：通过自然启发算法训练的离散小波变换和多层的验证损失检测
3. Passive sonar target classification using multi-layer perceptron trained by salp swarm algorithm [J] . Khishe Mohammad, Mohammadi Hassan Ocean Engineering . 2019,第JUNa1期

机译：Salp群算法训练的多层感知器被动声纳目标分类
4. Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation [C] . Stavros Adam, Dimitrios Alexios Karras, Michael N. Vrahatis ICONIP 2008;International conference on advances in neuro-information processing . 2009

机译：再谈反向传播训练的多层感知器的权重初始化问题
5. Classification of Respiratory Physiology with Airwave Oscillometry Using Multi-layer Perceptron [D] . Xu, Tong . 2020

机译：利用多层扫描法与空洞示波性分类
6. Predicting Risk of Antenatal Depression and Anxiety Using Multi-Layer Perceptrons and Support Vector Machines [O] . Fajar Javed, Syed Omer Gilani, Seemab Latif, 2021

机译：使用多层感知和支持向量机预测产前抑郁和焦虑的风险
7. Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation [O] . Stavros Adam, Dimitrios Alexios Karras, Michael N. Vrahatis 2013

机译：再谈反向传播训练的多层感知器的权重初始化问题

Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation

摘要

著录项

相似文献

相关主题

期刊订阅