首页> 外文会议>International Conference on Neural Information Processing >Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation
【24h】

Revisiting the Problem of Weight Initialization for Multi-Layer Perceptrons Trained with Back Propagation

机译:重新审视用回到传播训练的多层训练的重量初始化问题

获取原文

摘要

One of the main reasons for the slow convergence and the suboptimal generalization results of MLP (Multilayer Perceptrons) based on gradient descent training is the lack of a proper initialization of the weights to be adjusted. Even sophisticated learning procedures are not able to compensate for bad initial values of weights, while good initial guess leads to fast convergence and or better generalization capability even with simple gradient-based error minimization techniques. Although initial weight space in MLPs seems so critical there is no study so far of its properties with regards to which regions lead to solutions or failures concerning generalization and convergence in real world problems. There exist only some preliminary studies for toy problems, like XOR. A data mining approach, based on Self Organizing Feature Maps (SOM), is involved in this paper to demonstrate that a complete analysis of the MLP weight space is possible. This is the main novelty of this paper. The conclusions drawn from this novel application of SOM algorithm in MLP analysis extend significantly previous preliminary results in the literature. MLP initialization procedures are overviewed along with all conclusions so far drawn in the literature and an extensive experimental study on more representative tasks, using our data mining approach, reveals important initial weight space properties of MLPs, extending previous knowledge and literature results.
机译:基于梯度下降训练的MLP(MultiDayer Perceptrons)的慢收敛和次优化结果的主要原因之一是缺乏要调整的权重的正确初始化。即使是复杂的学习程序也无法补偿重量的差值不良,而良好的初步猜测也会导致快速收敛和更好的泛化能力,即使具有简单的基于梯度的误差最小化技术。虽然MLP中的初始重量空间似乎如此至关重要,但到目前为止还没有研究其在哪些地区导致关于现实世界问题的泛化和收敛的解决方案或失败。只存在一些对玩具问题的初步研究,如XOR。基于自组织特征映射(SOM)的数据挖掘方法涉及本文,以证明对MLP重量空间的完全分析是可能的。这是本文的主要新颖性。从MLP分析中的SOM算法的这种新应用得出的结论在文献中显着延伸了先前的初步结果。概述了MLP初始化程序以及在文献中绘制的所有结论以及使用我们的数据挖掘方法的更多代表性任务的广泛实验研究,揭示了MLP的重要初始重量空间特性,延长了先前的知识和文献结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号