Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

Onesimo Hernandez-Lerma; Jean B. Lasserre

首页> 外文期刊>Acta Applicandae Mathematicae: An International Journal on Applying Mathematics and Mathematical Applications >Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

【24h】

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

机译：Borel空间上平均成本马尔可夫控制过程的策略迭代

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper studies the policy iteration algorithm (PIA) for average cost Markov control processes on Borel spaces. Two classes of MCPs are considered. One of them allows some restricted-growth unbounded cost functions and compact control constraint sets; the other one requires strictly unbounded costs and the control constraint sets may be non-compact. For each of these classes, the PIA yields, under suitable assumptions, the optimal (minimum) cost, an optimal stationary control policy, and a solution to the average cost optimality equation.

机译：本文研究了Borel空间上平均成本Markov控制过程的策略迭代算法（PIA）。考虑了两类MCP。其中之一允许一些限制增长的无穷成本函数和紧凑的控制约束集;另一种则要求严格的无限制成本，并且控制约束集可能不紧凑。对于这些类别中的每一个类别，PIA都会在适当的假设下产生最佳（最小）成本，最佳固定控制策略以及平均成本最优方程的解。

著录项

来源
《Acta Applicandae Mathematicae: An International Journal on Applying Mathematics and Mathematical Applications》 |1997年第2期|共30页
作者
Onesimo Hernandez-Lerma; Jean B. Lasserre;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类应用数学;
关键词
(discrete-time) markov control processes; average cost; policy iteration (a.k.a. Howard's algorithm);

机译：（离散）马尔可夫控制过程;平均成本;政策迭代（又称霍华德算法）;
入库时间 2022-08-18 09:59:03

相似文献

外文文献
中文文献
专利

1. 基于策略迭代的空间系绳载荷捕获自适应最优控制 [J] . 冯毅庭, 张鸣, 郭闻昊, 南京航空航天大学学报（英文版） . 2021,第004期
2. Policy Iteration for Average Cost Markov Control Processes on Borel Spaces [J] . Onesimo Hernandez-Lerma, Jean B. Lasserre Acta Applicandae Mathematicae: An International Journal on Applying Mathematics and Mathematical Applications . 1997,第2期

机译：Borel空间上平均成本马尔可夫控制过程的策略迭代
3. A PERTURBATION APPROACH TO APPROXIMATE VALUE ITERATION FOR AVERAGE COST MARKOV DECISION PROCESSES WITH BOREL SPACES AND BOUNDED COSTS [J] . Vega-Amaya Oscar, Lopez-Borbon Joaqun Kybernetika . 2019,第1期

机译：具有BOREL空间和绑定成本的平均成本MARKOV决策过程的近似值迭代的扰动方法
4. A PERTURBATION APPROACH TO APPROXIMATE VALUE ITERATION FOR AVERAGE COST MARKOV DECISION PROCESSES WITH BOREL SPACES AND BOUNDED COSTS [J] . Vega-Amaya Oscar, Lopez-Borbon Joaqun Kybernetika . 2019,第1期

机译：具有Borel空间和界限成本的平均成本马尔可夫决策过程近似值迭代的扰动方法
5. Finite-state approximation of Markov decision processes with unbounded costs and Borel spaces [C] . Naci Saldi, Serdar Yuksel, Tamas Linder IEEE Conference on Decision and Control . 2015

机译：具有无限成本和Borel空间的Markov决策过程的有限状态近似
6. A Markovian Optimization Model for Pavement Maintenance Using Policy Iteration Algorithm with Discounted Road-user and Agency Costs [D] . Narh-Dometey, Anita. 2019

机译：利用折扣道路用户和机构成本的策略迭代算法的路面维护马尔瓦维亚优化模型
7. Approximation methods for piecewise deterministic Markov processes and their costs [O] . Peter Kritzer, Gunther Leobacher, Michaela Szölgyenyi, -1

机译：分段确定性马尔可夫过程的逼近方法及其成本
8. A perturbation approach to approximate value iteration for average cost Markov decision processes with Borel spaces and bounded costs [O] . Óscar Vega-Amaya, Joaquín López-Borbón 2019

机译：具有Borel空间和界限成本的平均成本马尔可夫决策过程近似值迭代的扰动方法
9. Discrete-Time Controlled Markov Processes With Average Cost Criterion: A Survey. [R] . Arapostathis, A., Borkar, V. S., Fernandez- Gaucherand, E., 1992

机译：具有平均成本标准的离散时间控制马尔可夫过程：一项调查。

Policy Iteration for Average Cost Markov Control Processes on Borel Spaces

摘要

著录项

相似文献

相关主题

期刊订阅