首页> 外文期刊>Global Journal of Technology and Optimization >Direct Differentiation Based Hessian Formulation for Training Multilayer Feed forward Neural Networks using the LM Algorithm-Performance Comparison with Conventional Jacobian-Based Learning
【24h】

Direct Differentiation Based Hessian Formulation for Training Multilayer Feed forward Neural Networks using the LM Algorithm-Performance Comparison with Conventional Jacobian-Based Learning

机译:LM算法的性能比较与基于传统Jacobian的学习方法的基于直接微分的Hessian公式训练多层前馈神经网络

获取原文
       

摘要

The Levenberg-Marquardt (LM) algorithm is the most commonly used training algorithm for moderate-sized feed forward artificial neural networks (ANNs) due to its high convergence rate and reasonably good accuracy. It conventionally employs a Jacobian-based approximation to the Hessian matrix, since exact evaluation of the Hessian matrix is generally considered computationally prohibitive. However, the storage of Jacobian matrix in computer memory is itself prone towards memory constraints, especially if the number of patterns in the training data exceeds a critical threshold. This paper presents a first attempt of evaluating the exact Hessian matrix using the direct differentiation approach for training a multilayer feed forward neural network using the LM algorithm. The weights employed for network training are initialized using a random number generator in MATLAB (R2010a). The efficiency of the proposed algorithm has been demonstrated using the well-known 2-spiral and the parity-N datasets, and the training performance has been compared with the Neural Network Toolbox in MATLAB (R2010a) which employs the conventional Jacobian-based learning methodology.
机译:Levenberg-Marquardt(LM)算法是中型前馈人工神经网络(ANN)的最常用训练算法,因为它具有较高的收敛速度和相当好的准确性。通常,由于对Hessian矩阵的精确评估通常被认为在计算上是禁止的,因此它通常对Hessian矩阵采用基于Jacobian的近似。但是,雅可比矩阵在计算机内存中的存储本身容易受到内存限制,尤其是在训练数据中的模式数量超过临界阈值的情况下。本文提出了使用直接微分方法评估精确的Hessian矩阵的首次尝试,以使用LM算法训练多层前馈神经网络。使用MATLAB(R2010a)中的随机数生成器初始化用于网络训练的权重。使用众所周知的2-spiral和parity-N数据集证明了该算法的效率,并将训练性能与MATLAB(R2010a)中的神经网络工具箱进行了比较,后者使用了传统的基于Jacobian的学习方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号