首页> 外文OA文献 >Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

【2h】

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

机译：神经网络中全局双重启发式动态规划的二阶梯度的简单快速计算

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

We derive an algorithm to exactly calculate the mixed second-order derivatives of a neural network's output with respect to its input vector and weight vector. This is necessary for the adaptive dynamic programming (ADP) algorithms globalized dual heuristic programming (GDHP) and value-gradient learning. The algorithm calculates the inner product of this second-order matrix with a given fixed vector in a time that is linear in the number of weights in the neural network. We use a “forward accumulation” of the derivative calculations which produces a much more elegant and easy-to-implement solution than has previously been published for this task. In doing so, the algorithm makes GDHP simple to implement and efficient, bridging the gap between the widely used DHP and GDHP ADP methods.

机译：我们导出一种算法，以精确计算神经网络输出相对于其输入向量和权重向量的混合二阶导数。这对于全球化的双重启发式编程（GDHP）和价值梯度学习的自适应动态编程（ADP）算法是必需的。该算法在神经网络中权重数量呈线性的时间内，用给定的固定向量计算该二阶矩阵的内积。我们使用导数计算的“前累加”来生成比以前为该任务发布的解决方案更加优雅和易于实现的解决方案。这样，该算法使GDHP易于实现且高效，弥合了广泛使用的DHP和GDHP ADP方法之间的差距。

著录项

作者
Fairbank M.; Alonso E.; Prokhorov D.;
展开▼
作者单位

展开▼
年度 2012
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks [J] . Fairbank M., Alonso E., Prokhorov D. Neural Networks and Learning Systems, IEEE Transactions on . 2012,第10期

机译：神经网络中全局双重启发式动态规划的二阶梯度的简单快速计算
2. Neural-Network-Based Optimal Control for a Class of Unknown Discrete-Time Nonlinear Systems Using Globalized Dual Heuristic Programming [J] . Liu D., Wang D., Zhao D., Automation Science and Engineering, IEEE Transactions on . 2012,第3期

机译：基于神经网络的一类未知离散非线性系统的全局最优启发式控制
3. Event-Triggered Globalized Dual Heuristic Programming and Its Application to Networked Control Systems [J] . Yi Jun, Chen Shi, Zhong Xiangnan, IEEE transactions on industrial informatics . 2019,第3期

机译：事件触发的全球化双重启发式程序设计及其在网络控制系统中的应用
4. Globalized Dual Heuristic Dynamic Programming in Control of Robotic Manipulator [C] . Marcin Szuster, Piotr Gierlak Symposium on Mechatronics Systems, Mechanics and Materials . 2016

机译：控制机器人操纵器的全球化双发生动态规划
5. Understanding and analyzing approximate dynamic programming with gradient -based framework and direct heuristic dynamic programming [D] . Yang, Lei 2008

机译：通过基于梯度的框架和直接启发式动态规划来理解和分析近似动态规划
6. From Heuristic to Mathematical Modeling of Drugs Dissolution Profiles: Application of Artificial Neural Networks and Genetic Programming [O] . Aleksander Mendyk, Sinan Güres, Renata Jachowicz, 2015

机译：从启发式到药物溶解曲线的数学建模：人工神经网络和遗传规划的应用
7. More on Training Strategies for Critic and Action Neural Networks in Dual Heuristic Programming Method [O] . George G. Lendaris, Christian Paintz, Thaddeus Shannon 1997

机译：双启发式编程方法中关于批评与行动神经网络训练策略的研究
8. Design of Neural Networks for Fast Convergence and Accuracy: Dynamics and Control [R] . Maghami, Peiman G., Sparks, Dean W., Jr. 1997

机译：快速收敛和精度的神经网络设计：动力学与控制

Simple and Fast Calculation of the Second-Order Gradients for Globalized Dual Heuristic Dynamic Programming in Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅