Hebbian Versus Gradient Training of ESN Actors in Closed-Loop ACD

机译：Hebbian与闭环ACD中ESN演员的渐变训练

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The present work continues investigations on combination between Adaptive Critic Design (ACD) approach - a gradient-based optimization technique - and a more biologically plausible associative or Hebbian learning. Echo state network (ESN) was used as adaptive critic element that was trained minimizing temporal difference error. While in the previous work the actor was a time profile of the action variable, here investigations are extended to the closed loop (feedback) control scheme. The actor is another ESN network and its inputs are some of the process state variables while its output is the value of the controlled variable. The only trainable connections of the actor - from its reservoir to the readout - are trained to minimize (maximize) the critic output. Comparison between backpropagation of utility approach that is gradient descent algorithm and a Hebbian learning law is made. These two approaches are tested on a task for optimization of a complex nonlinear process for bio-polymer production. The obtained results are compared with respect to the convergence speed as well as to the obtained solution, i.e. reached local optima.

机译：目前的工作继续调查适应性批评设计（ACD）方法 - 一种基于梯度的优化技术 - 以及更具生物合理的联想或Hebbian学习。 Echo State Network（ESN）被用作自适应批评元素，训练最小化时间差错误差。虽然在上一个工作中，演员是动作变量的时间轮廓，但在这里调查扩展到闭环（反馈）控制方案。演员是另一个ESN网络，其输入是一些过程状态变量，而其输出是受控变量的值。 actor - 从它的水库到读数的唯一可训练连接 - 训练，以最小化（最大化）批评批评输出。制作了梯度下降算法的实用方法的BackProjagation与Hebbian学习法的比较。在任务上测试这两种方法以优化生物聚合物生产的复杂非线性方法。将得到的结果与收敛速度以及所得溶液相比，即达到局部最佳溶液。

著录项

来源
《International Conference on Numerical Methods and Applications》|2015年||共8页
会议地点
作者
Petia Koprinkova-Hristova;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301.6-53;
关键词
Reinforcement Learning; Hebbian learning; Adaptive Critic Design; Echo State Networks;

机译：加强学习;Hebbian学习;自适应批评设计;回声状态网络;

相似文献

外文文献
中文文献
专利

1. Sagittal Reconstruction and Clinical Outcome Using Traditional ACDF, Versus Stand-alone ACDF Versus TDR A Systematic Review and Quantitative Analysis [J] . Katsuura Yoshihiro, York Philip J., Goto Rie, Spine . 2019,第19期

机译：使用传统ACDF的矢状重建和临床结果，与独立的ACDF与TDR进行了系统审查和定量分析
2. Behavioral analysis of differential hebbian learning in closed-loop systems [J] . Kulvicius T., Kolodziejski C., Tamosiunaite M., Biological Cybernetics: Communication and Control in Organisms and Automata: = Nachrichtenubertragung, Nachrichtenverarbeitung, Steuerung und Regelung in Organismen und in Automaten . 2010,第4期

机译：闭环系统中差分hebbian学习的行为分析
3. Behavioral analysis of differential hebbian learning in closed-loop systems [J] . Tomas Kulvicius, Christoph Kolodziejski, Minija Tamosiunaite, Biological Cybernetics . 2010,第4期

机译：闭环系统中差分hebbian学习的行为分析
4. Hebbian Versus Gradient Training of ESN Actors in Closed-Loop ACD [C] . Petia Koprinkova-Hristova International Conference on Numerical Methods and Applications . 2015

机译：Hebbian与闭环ACD中ESN演员的渐变训练
5. THE EFFECT OF BIOFEEDBACK AND RELAXATION TRAINING ON STUDENTS STUDYING VOICE IN AN ACTOR TRAINING PROGRAM: A PRELIMINARY STUDY. [D] . LOFT, MARGARET. 1987

机译：生物反馈和放松训练对在角色训练课程中学习语音的学生的影响：一项初步研究。
6. P16-35. Specific CD4 responses to HIV-1 epitopes in exposed seronegative (ESN) versus infected commercial sex workers [O] . MW Kiguoya, J Mwanjewe, B Ball, 2009

机译：P16-35。暴露的血清阴性（ESN）与受感染的商业性工作者对HIV-1表位的特异性CD4反应
7. Behavioral analysis of differential hebbian learning in closed-loop systems [O] . Tomas Kulvicius, Christoph Kolodziejski, Minija Tamosiunaite, 2010

机译：闭环系统中差分hebbian学习的行为分析

Hebbian Versus Gradient Training of ESN Actors in Closed-Loop ACD

摘要

著录项

相似文献

相关主题

期刊订阅