首页> 美国政府科技报告 >Kernel-Based Approximate Dynamic Programming Using Bellman Residual Elimination

【24h】

Kernel-Based Approximate Dynamic Programming Using Bellman Residual Elimination

机译：基于贝尔曼残差消除的核基近似动态规划

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Many sequential decision-making problems related to multi-agent robotic systems can be naturally posed as Markov Decision Processes (MDPs). An important advantage of the MDP framework is the ability to utilize stochastic system models, thereby allowing the system to make sound decisions even if there is randomness in the system evolution over time. Unfortunately, the curse of dimensionality prevents most MDPs of practical size from being solved exactly. One main focus of the thesis is on the development of a new family of algorithms for computing approximate solutions to large-scale MDPs. Our algorithms are similar in spirit to Bellman residual methods, which attempt to minimize the error incurred in solving Bellman's equation at a set of sample states. However, by exploiting kernel-based regression techniques (such as support vector regression and Gaussian process regression) with nondegenerate kernel functions as the underlying cost-to-go function approximation architecture, our algorithms are able to construct cost-to-go solutions for which the Bellman residuals are explicitly forced to zero at the sample states. For this reason, we have named our approach Bellman residual elimination (BRE). In addition to developing the basic ideas behind BRE, we present multi-stage and model-free extensions to the approach. The multistage extension allows for automatic selection of an appropriate kernel for the MDP at hand, while the model-free extension can use simulated or real state trajectory data to learn an approximate policy when a system model is unavailable.

著录项

作者
Bethke, B. M.;
展开▼
作者单位

展开▼
年度 2010
页码 p.1-222
总页数 222
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Markov processes ; Decision making ; Theses ; Dynamic programming ; Elimination ; Algorithms ; Kernel functions;

机译：马尔可夫过程;决策;论文;动态规划;消除;算法;核函数;

相似文献

外文文献
中文文献
专利

1. 基于贝尔曼动态规划的服务恢复决策方法 [J] . 何蕾, 任江春, 王志英东南大学学报（英文版） . 2008,第003期
2. Approximate dynamic programming via iterated Bellman inequalities [J] . Wang Yang, ODonoghue Brendan, Boyd Stephen International Journal of Robust and Nonlinear Control . 2015,第10期

机译：通过反复的Bellman不等式进行近似动态编程
3. Hamilton–Jacobi–Bellman Equations and Approximate Dynamic Programming on Time Scales [J] . Seiffertt J., Sanyal S., Wunsch D. C. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2008,第4期

机译：Hamilton-Jacobi-Bellman方程和时标上的近似动态规划
4. Hamilton;Jacobi;Bellman Equations and Approximate Dynamic Programming on Time Scales [J] . Seiffertt J., Sanyal S., Wunsch D.C. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics . 2008,第4期

机译：Hamilton; Jacobi; Bellman方程和时标上的近似动态规划
5. Approximate dynamic programming using model-free Bellman Residual Elimination [C] . Bethke Brett, How Jonathan P. 2010 American Control Conference . 2010

机译：使用无模型Bellman残差消除的近似动态编程
6. On Numerical Stochastic Optimal Control via Bellman's Dynamic Programming Principle [D] . Aboagye, Prince Osei. 2018

机译：通过Bellman动态规划原理的数值随机最佳控制
7. Bellman’s GAP—a language and compiler for dynamic programming in sequence analysis [O] . Georg Sauthoff, Mathias Möhl, Stefan Janssen, -1

机译：Bellman的GAP-一种用于序列分析中动态编程的语言和编译器
8. Kernel-based approximate dynamic programming using Bellman residual elimination [O] . Bethke Brett (Brett M.) 2010

机译：基于核的近似动态规划使用Bellman残差消除

Kernel-Based Approximate Dynamic Programming Using Bellman Residual Elimination

摘要

著录项

相似文献

相关主题

期刊订阅