Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation

机译：再谈反向传播训练的多层感知器的权重初始化问题

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One of the main reasons for the slow convergence and the suboptimal generalization results of MLP (Multilayer Perceptrons) based on gradient descent training is the lack of a proper initialization of the weights to be adjusted. Even sophisticated learning procedures are not able to compensate for bad initial values of weights, while good initial guess leads to fast convergence and or better generalization capability even with simple gradient-based error minimization techniques. Although initial weight space in MLPs seems so critical there is no study so far of its properties with regards to which regions lead to solutions or failures concerning generalization and convergence in real world problems. There exist only some preliminary studies for toy problems, like XOR. A data mining approach, based on Self Organizing Feature Maps (SOM), is involved in this paper to demonstrate that a complete analysis of the MLP weight space is possible. This is the main novelty of this paper. The conclusions drawn from this novel application of SOM algorithm in MLP analysis extend significantly previous preliminary results in the literature. MLP initialization procedures are overviewed along with all conclusions so far drawn in the literature and an extensive experimental study on more representative tasks, using our data mining approach, reveals important initial weight space properties of MLPs, extending previous knowledge and literature results.

机译：基于梯度下降训练的MLP（多层感知器）收敛缓慢且泛化结果欠佳的主要原因之一是缺乏适当的初始化权值来进行调整。即使是复杂的学习程序也无法补偿错误的初始权值，而良好的初始猜测甚至可以使用简单的基于梯度的误差最小化技术来实现快速收敛和/或更好的泛化能力。尽管MLP中的初始权重空间似乎非常关键，但迄今为止，尚未针对其区域导致有关现实世界中的泛化和收敛的解决方案或失败问题进行研究。对于诸如XOR之类的玩具问题，仅存在一些初步研究。本文涉及一种基于自组织特征图（SOM）的数据挖掘方法，以证明对MLP权空间进行完整分析是可能的。这是本文的主要新颖之处。从SOM算法在MLP分析中的这种新颖应用得出的结论大大扩展了文献中先前的初步结果。概述了MLP初始化程序以及迄今为止在文献中得出的所有结论，并使用我们的数据挖掘方法对更具代表性的任务进行了广泛的实验研究，揭示了MLP的重要初始权空间特性，扩展了先前的知识和文献报道的范围。

著录项

来源
《ICONIP 2008;International conference on advances in neuro-information processing》|2009年|308-315|共8页
会议地点
作者
Stavros Adam; Dimitrios Alexios Karras; Michael N. Vrahatis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息处理（信息加工）;
关键词

相似文献

外文文献
中文文献
专利

1. Invariant Set of Weight of Perceptron Trained by Perceptron Training Algorithm [J] . Ho C. Y.-F., Ling B. W.-K., Iu H. H.-C. Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on . 2010,第6期

机译：感知器训练算法训练的感知器权重不变集
2. Hearing loss detection by discrete wavelet transform and multi-layer perceptron trained by nature-inspired algorithms [J] . Jingyuan Yang, Vishnu Varthanan Govindaraj, Ming Yang, Multimedia Tools and Applications . 2020,第21a22期

机译：通过自然启发算法训练的离散小波变换和多层的验证损失检测
3. Passive sonar target classification using multi-layer perceptron trained by salp swarm algorithm [J] . Khishe Mohammad, Mohammadi Hassan Ocean Engineering . 2019,第JUNa1期

机译：Salp群算法训练的多层感知器被动声纳目标分类
4. Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation [C] . Stavros Adam, Dimitrios Alexios Karras, Michael N. Vrahatis International Conference on Neural Information Processing . 2009

机译：重新审视用回到传播训练的多层训练的重量初始化问题
5. Classification of Respiratory Physiology with Airwave Oscillometry Using Multi-layer Perceptron [D] . Xu, Tong . 2020

机译：利用多层扫描法与空洞示波性分类
6. Predicting Risk of Antenatal Depression and Anxiety Using Multi-Layer Perceptrons and Support Vector Machines [O] . Fajar Javed, Syed Omer Gilani, Seemab Latif, 2021

机译：使用多层感知和支持向量机预测产前抑郁和焦虑的风险
7. Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation [O] . Stavros Adam, Dimitrios Alexios Karras, Michael N. Vrahatis 2013

机译：再谈反向传播训练的多层感知器的权重初始化问题

Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation

摘要

著录项

相似文献

相关主题

期刊订阅