首页> 外文会议> >A class of two-dimensional stochastic approximations and steering policies for Markov decision processes

【24h】

A class of two-dimensional stochastic approximations and steering policies for Markov decision processes

机译：一类用于Markov决策过程的二维随机逼近和控制策略

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The authors consider a specific multidimensional stochastic approximation scheme of the Robbins-Monro type that naturally arises in the study of steering policies for Markov decision processes. The usual convergence results (in the almost sure sense) do not seem to apply for this simple scheme. Almost sure convergence is established by an indirect argument that blends standard results on stochastic approximations with a version of the law of large number for martingale differences. These convergence properties provide an alternative proof for some of the properties of steering policies.

机译：作者考虑了在研究马尔可夫决策过程的转向策略时自然产生的一种特定的Robbins-Monro型多维随机逼近方案。通常的收敛结果（几乎可以肯定地说）似乎不适用于这种简单方案。几乎可以肯定的收敛是通过一个间接论点建立的，该论点将随机近似上的标准结果与针对mar差异的大数定律的一个版本混合在一起。这些收敛属性为转向策略的某些属性提供了替代证明。

著录项

来源
《》|1992年|P.3344-3349|共6页
会议地点
作者
Ma; D.-J.; Makowski; A.M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词

相似文献

外文文献
中文文献
专利

1. Stochastic approximations of constrained discounted Markov decision processes [J] . Fran?ois Dufour, Tomás Prieto-Rumeau Journal of Mathematical Analysis and Applications . 2014,第2期

机译：约束折扣马尔可夫决策过程的随机逼近
2. On discounted approximations of undiscounted stochastic games and Markov decision processes with limited randomness [J] . Boros E., Elbassioni K., Gurvich V., Operations Research Letters: A Journal of the Operations Research Society of America . 2013,第4期

机译：有限随机性下无折扣随机博弈的折现近似和马尔可夫决策过程
3. Stochastic Approximation for Risk-Aware Markov Decision Processes [J] . Huang Wenjie, Haskell William B. IEEE Transactions on Automatic Control . 2021,第3期

机译：风险感知马尔可夫决策过程的随机近似
4. A class of two-dimensional stochastic approximations and steering policies for Markov decision processes [C] . Ma D.-J., Makowski A.M., Institute of Electric and Electronic Engineer IEEE conference on decision and control . 1992

机译：马尔可夫决策过程的一类二维随机近似和转向政策
5. Linear approximations for factored Markov decision processes. [D] . Patrascu, Relu-Eugen. 2005

机译：因子马尔可夫决策过程的线性近似。
6. Evolving Robust Policy Coverage Sets in Multi-Objective Markov Decision Processes Through Intrinsically Motivated Self-Play [O] . Sherif Abdelfattah, Kathryn Kasmarik, Jiankun Hu 2018

机译：通过内在动机的自我博弈在多目标马尔可夫决策过程中发展稳健的政策覆盖范围
7. Finite state approximations for denumerable-state infinite horizon contracted Markov decision processes: The policy space method [O] . White D.J 1979

机译：可数状态无限视野收缩Markov决策过程的有限状态近似：策略空间方法
8. Steering Policies for Markov Decision Processes Under a Recurrence Condition. [R] . Ma, D., Makowski, A. M. 1988

机译：重复条件下马尔可夫决策过程的指导策略。

A class of two-dimensional stochastic approximations and steering policies for Markov decision processes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅