Using Strong Convergence to Accelerate Value Iteration.

机译：利用强收敛加速价值迭代。

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Convergence of the relative value function (total value function less total value of a base state) and the optimal policy in length of the horizon for value iteration (markov decision programming) have been recently shown to be geometric with factor alpha beta. Here alpha is the discount factor, and beta < or = 1.0. The case beta < 1.0 is termed 'strong convergence'. It is suggested in this paper that bounds on the convergence rate be estimated computationally during the value iteration process, giving bounds directly on the extrapolated infinite horizon relative value function. Such an extrapolation has two purposes. First, large numbers of iterations in value iteration can be skipped by continuing computation directly with the estimated infinite horizon relative value function (this is directly analogous to quadratic acceleration procedures in non-linear programming). Second, existing procedures for the elimination of non-optimal actions are greatly strengthened, since actions can be eliminated permanently once bounds on the infinite horizon relative value function become sufficiently tight.

著录项

作者
Morton, T. E.;
展开▼
作者单位

展开▼
年度 1976
页码 1-13
总页数 13
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Mathematical programming; Markov processes; Convergence; Iterations; Computations; Theorems; Markov decision processes;

机译：数学规划;马尔可夫过程;收敛;迭代;计算;定理;马尔可夫决策过程;

相似文献

外文文献
中文文献
专利

1. STRONG CONVERGENCE OF FULL-DISCRETE NONLINEARITY-TRUNCATED ACCELERATED EXPONENTIAL EULER-TYPE APPROXIMATIONS FOR STOCHASTIC KURAMOTO-SIVASHINSKY EQUATIONS [J] . Hutzenthaler Martin, Jentzen Arnulf, Salimova Diyora Communications in mathematical sciences . 2018,第6期

机译：随机Kuramoto-Sivashinsky方程的全离散非线性截断加速呈指数互连近似的强大趋同
2. Accelerating Convergence and Upgrading Radio and Television Convergence in China [J] . Asia-Pacific Broadcasting Union Technical Review . 2017,第271期

机译：加速融合，提升中国广播电视融合
3. Almost uniform and strong convergences in ergodic theorems for symmetric spaces [J] . Chilin V., Litvinov S. Acta mathematica Hungarica . 2019,第1期

机译：对称空间的ergodic定理几乎均匀和强烈的收敛
4. Wijsman deferred statistical convergence and Wijsman strong deferred Cesaro convergence of order a of sequences of sets [C] . M. Cagri Yilmazer, Mikail Et, Hacer Sengul Kandemir International Conference of Mathematical Sciences . 2021

机译：Wijsman延迟统计收敛和Wijsman强大的延迟塞萨罗序列序列的序列A序列
5. Experimental and numerical study of the bleed effect on the propagation of strong plane and converging cylindrical shock waves. [D] . El-Mallah, Mourad. 1997

机译：渗流对强平面传播和会聚圆柱激波传播的影响的实验和数值研究。
6. An atropisomeric M2L4 cage mixture displaying guest-induced convergence and strong guest emission in water [O] . Takahiro Tsutsui, Lorenzo Catti, Kenji Yoza, 2020

机译：展示客人诱导的趋同和水中的强大客体排放的阿托异构M2L4笼式混合物
7. Strong convergence of full-discrete nonlinearity-truncated accelerated exponential euler-type approximations for stochastic Kuramoto–Sivashinsky equations [O] . Martin Hutzenthaler, Arnulf Jentzen, Diyora Salimova 2018

机译：随机Kuramoto-Sivashinsky方程的全离散非线性截断加速呈指数互连近似的强大趋同
8. Strong Convergence of Kleinian Groups and Caratheodory Convergence of Domains ofDiscontinuity [R] . Ohshika, K. 1991

机译：Kleien群的强收敛性与不连续域的Caratheodory收敛性

Using Strong Convergence to Accelerate Value Iteration.

摘要

著录项

相似文献

相关主题

期刊订阅