Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

Fairbank M.; Alonso E.; Prokhorov D.

首页> 外文期刊>Neural Networks and Learning Systems, IEEE Transactions on >Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

【24h】

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

机译：神经网络中全局双重启发式动态规划的二阶梯度的简单快速计算

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We derive an algorithm to exactly calculate the mixed second-order derivatives of a neural network''s output with respect to its input vector and weight vector. This is necessary for the adaptive dynamic programming (ADP) algorithms globalized dual heuristic programming (GDHP) and value-gradient learning. The algorithm calculates the inner product of this second-order matrix with a given fixed vector in a time that is linear in the number of weights in the neural network. We use a “forward accumulation” of the derivative calculations which produces a much more elegant and easy-to-implement solution than has previously been published for this task. In doing so, the algorithm makes GDHP simple to implement and efficient, bridging the gap between the widely used DHP and GDHP ADP methods.

机译：我们推导了一种算法，可精确计算神经网络输出相对于其输入向量和权重向量的混合二阶导数。这对于全球化的双重启发式编程（GDHP）和价值梯度学习的自适应动态编程（ADP）算法是必需的。该算法在神经网络中权重数量呈线性的时间内，用给定的固定向量计算该二阶矩阵的内积。我们使用导数计算的“前累加”来生成比以前为该任务发布的解决方案更加优雅和易于实现的解决方案。这样，该算法使GDHP易于实现且高效，弥合了广泛使用的DHP和GDHP ADP方法之间的差距。

著录项

来源
《Neural Networks and Learning Systems, IEEE Transactions on》 |2012年第10期|p.1671-1676|共6页
作者
Fairbank M.; Alonso E.; Prokhorov D.;
展开▼
作者单位

Department of Computing, School of Informatics, City University London, London, U.K.;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Adaptive dynamic programming; dual heuristic programming; neural networks; value-gradient learning;

机译：自适应动态规划;双重启发式规划;神经网络;价值梯度学习;

相似文献

外文文献
中文文献
专利

1. Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming [J] . Liu D., Wang D., Zhao D., Automation Science and Engineering, IEEE Transactions on . 2012,第3期

机译：基于神经网络的一类未知离散非线性系统的全局最优启发式控制
2. Event-Triggered Globalized Dual Heuristic Programming and Its Application to Networked Control Systems [J] . Yi Jun, Chen Shi, Zhong Xiangnan, IEEE transactions on industrial informatics . 2019,第3期

机译：事件触发的全球化双重启发式程序设计及其在网络控制系统中的应用
3. Event-Triggered Globalized Dual Heuristic Programming and Its Application to Networked Control Systems [J] . Yi Jun, Chen Shi, Zhong Xiangnan, IEEE transactions on industrial informatics . 2019,第3期

机译：事件触发的全球化双发主义编程及其在网络控制系统中的应用
4. Globalized Dual Heuristic Dynamic Programming in Control of Robotic Manipulator [C] . Marcin Szuster, Piotr Gierlak Symposium on Mechatronics Systems, Mechanics and Materials . 2016

机译：控制机器人操纵器的全球化双发生动态规划
5. Understanding and analyzing approximate dynamic programming with gradient -based framework and direct heuristic dynamic programming [D] . Yang, Lei 2008

机译：通过基于梯度的框架和直接启发式动态规划来理解和分析近似动态规划
6. From Heuristic to Mathematical Modeling of Drugs Dissolution Profiles: Application of Artificial Neural Networks and Genetic Programming [O] . Aleksander Mendyk, Sinan Güres, Renata Jachowicz, 2015

机译：从启发式到药物溶解曲线的数学建模：人工神经网络和遗传规划的应用
7. Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks [O] . Fairbank M., Alonso E., Prokhorov D. 2012

机译：神经网络中全局双重启发式动态规划的二阶梯度的简单快速计算
8. Design of Neural Networks for Fast Convergence and Accuracy: Dynamics and Control [R] . Maghami, Peiman G., Sparks, Dean W., Jr. 1997

机译：快速收敛和精度的神经网络设计：动力学与控制

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅