首页> 外文会议>International Conference on Numerical Methods and Applications >Hebbian Versus Gradient Training of ESN Actors in Closed-Loop ACD
【24h】

Hebbian Versus Gradient Training of ESN Actors in Closed-Loop ACD

机译:Hebbian与闭环ACD中ESN演员的渐变训练

获取原文

摘要

The present work continues investigations on combination between Adaptive Critic Design (ACD) approach - a gradient-based optimization technique - and a more biologically plausible associative or Hebbian learning. Echo state network (ESN) was used as adaptive critic element that was trained minimizing temporal difference error. While in the previous work the actor was a time profile of the action variable, here investigations are extended to the closed loop (feedback) control scheme. The actor is another ESN network and its inputs are some of the process state variables while its output is the value of the controlled variable. The only trainable connections of the actor - from its reservoir to the readout - are trained to minimize (maximize) the critic output. Comparison between backpropagation of utility approach that is gradient descent algorithm and a Hebbian learning law is made. These two approaches are tested on a task for optimization of a complex nonlinear process for bio-polymer production. The obtained results are compared with respect to the convergence speed as well as to the obtained solution, i.e. reached local optima.
机译:目前的工作继续调查适应性批评设计(ACD)方法 - 一种基于梯度的优化技术 - 以及更具生物合理的联想或Hebbian学习。 Echo State Network(ESN)被用作自适应批评元素,训练最小化时间差错误差。虽然在上一个工作中,演员是动作变量的时间轮廓,但在这里调查扩展到闭环(反馈)控制方案。演员是另一个ESN网络,其输入是一些过程状态变量,而其输出是受控变量的值。 actor - 从它的水库到读数的唯一可训练连接 - 训练,以最小化(最大化)批评批评输出。制作了梯度下降算法的实用方法的BackProjagation与Hebbian学习法的比较。在任务上测试这两种方法以优化生物聚合物生产的复杂非线性方法。将得到的结果与收敛速度以及所得溶液相比,即达到局部最佳溶液。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号