【24h】

ADVANCED NEURAL NETWORK LEARNING APPLIED TO MODELING OF PULPING OF SUGAR MAPLE

机译:先进的神经网络学习应用于糖浆制浆建模

获取原文
获取原文并翻译 | 示例

摘要

This paper reports work done to improve the modeling of complex processes when only small experimental datum sets are available. Neural networks are used to capture the nonlinear underlying phenomena contained in the data set and to partly eliminate the burden of having to specify completely the structure of the model. Two different types of neural networks were used for the application of Pulping of Sugar Maple problem. A three layer feed forward neural networks, using the Preconditioned Conjugate Gradient (PCG) methods were used in this investigation. Preconditioning is a method to improve convergence by lowering the condition number and increasing the eigenvalues clustering. The idea is to solve the modified problem M~(-1) Ax = M~(-1) b where Mis a positive-definite preconditioner that is closely related to A. We mainly focused on Preconditioned Conjugate Gradient- based training methods which originated from optimization theory, namely Preconditioned Conjugate Gradient with Fletcher-Reeves Update (PCGF), Preconditioned Conjugate Gradient with Polak-Ribiere Update (PCGP) and Preconditioned Conjugate Gradient with Powell-Beale Restarts (PCGB). The computational experiments revealed that the PCG methods, through its preconditioner matrix, reduced drastically the mean squared error during training. The CG and PCG methods have a much faster convergence rate than the BP; since it uses second order information to calculate the new direction (Rao, 1978). All the trials for the PCG methods converged to the required solution indicating that the choice of initial weights is appropriate. Even though it does not guarantee convergence to the required mean squared error, it reduces the possibility of getting stuck in local minima. The PCG algorithms have proved its capability in the generalization aspect, at the same time preserving the efficient convergence of the algorithm. The variation in the network outputs is explained very well by the corresponding targets. The results of the simulations suggest that, using preconditioning techniques, the condition numbers of a matrix will be improved. If the eigenvalues of the input matrix have been clustered, we can iteratively solve the problem more quickly. The behavior of the PCG methods in the simulations proved to be robust against phenomenon such as oscillations due to large step size.
机译:当只有较小的实验数据集可用时,本文报告为改善复杂过程的建模所做的工作。神经网络用于捕获数据集中包含的非线性潜在现象,并部分消除了必须完全指定模型结构的负担。两种不同类型的神经网络被用于糖枫问题的制浆。在这项研究中使用了三层前馈神经网络,使用了预处理共轭梯度(PCG)方法。预处理是一种通过降低条件数量和增加特征值聚类来提高收敛性的方法。这个想法是要解决修正的问题M〜(-1)Ax = M〜(-1)b,其中Mis是与A密切相关的正定前置条件。我们主要研究基于预置共轭梯度的训练方法从优化理论中得出,即带Fletcher-Reeves更新的预条件共轭梯度(PCGF),带Polak-Ribiere更新的预条件共轭梯度(PCGP)和带Powell-Beale重新启动的预条件共轭梯度(PCGB)。计算实验表明,PCG方法通过其预处理器矩阵可以大大减少训练过程中的均方误差。 CG和PCG方法的收敛速度比BP快得多。因为它使用二阶信息来计算新方向(Rao,1978)。 PCG方法的所有试验都收敛到所需的解决方案,表明初始权重的选择是适当的。即使不能保证收敛到所需的均方误差,也可以减少陷入局部最小值的可能性。 PCG算法已在泛化方面证明了其功能,同时保留了算法的有效收敛性。相应的目标很好地说明了网络输出的变化。仿真结果表明,使用预处理技术,可以改善矩阵的条件数。如果输入矩阵的特征值已聚类,则可以更快地迭代解决问题。在仿真中,PCG方法的行为被证明对步长较大导致的诸如振荡之类的现象具有鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号