首页> 外文会议>ICONIP 2008;International conference on advances in neuro-information processing >Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation
【24h】

Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation

机译:再谈反向传播训练的多层感知器的权重初始化问题

获取原文

摘要

One of the main reasons for the slow convergence and the suboptimal generalization results of MLP (Multilayer Perceptrons) based on gradient descent training is the lack of a proper initialization of the weights to be adjusted. Even sophisticated learning procedures are not able to compensate for bad initial values of weights, while good initial guess leads to fast convergence and or better generalization capability even with simple gradient-based error minimization techniques. Although initial weight space in MLPs seems so critical there is no study so far of its properties with regards to which regions lead to solutions or failures concerning generalization and convergence in real world problems. There exist only some preliminary studies for toy problems, like XOR. A data mining approach, based on Self Organizing Feature Maps (SOM), is involved in this paper to demonstrate that a complete analysis of the MLP weight space is possible. This is the main novelty of this paper. The conclusions drawn from this novel application of SOM algorithm in MLP analysis extend significantly previous preliminary results in the literature. MLP initialization procedures are overviewed along with all conclusions so far drawn in the literature and an extensive experimental study on more representative tasks, using our data mining approach, reveals important initial weight space properties of MLPs, extending previous knowledge and literature results.
机译:基于梯度下降训练的MLP(多层感知器)收敛缓慢且泛化结果欠佳的主要原因之一是缺乏适当的初始化权值来进行调整。即使是复杂的学习程序也无法补偿错误的初始权值,而良好的初始猜测甚至可以使用简单的基于梯度的误差最小化技术来实现快速收敛和/或更好的泛化能力。尽管MLP中的初始权重空间似乎非常关键,但迄今为止,尚未针对其区域导致有关现实世界中的泛化和收敛的解决方案或失败问题进行研究。对于诸如XOR之类的玩具问题,仅存在一些初步研究。本文涉及一种基于自组织特征图(SOM)的数据挖掘方法,以证明对MLP权空间进行完整分析是可能的。这是本文的主要新颖之处。从SOM算法在MLP分析中的这种新颖应用得出的结论大大扩展了文献中先前的初步结果。概述了MLP初始化程序以及迄今为止在文献中得出的所有结论,并使用我们的数据挖掘方法对更具代表性的任务进行了广泛的实验研究,揭示了MLP的重要初始权空间特性,扩展了先前的知识和文献报道的范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号