AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES

Costa OLV; Dufour F

首页> 外文期刊>SIAM Journal on Control and Optimization >AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES

【24h】

AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES

机译：确定性马尔可夫过程的平均连续控制

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper deals with the long run average continuous control problem of piecewise deterministic Markov processes (PDMPs) taking values in a general Borel space and with compact action space depending on the state variable. The control variable acts on the jump rate and transition measure of the PDMP, and the running and boundary costs are assumed to be positive but not necessarily bounded. Our first main result is to obtain an optimality equation for the long run average cost in terms of a discrete-time optimality equation related to the embedded Markov chain given by the postjump location of the PDMP. Our second main result guarantees the existence of a feedback measurable selector for the discrete-time optimality equation by establishing a connection between this equation and an integro-differential equation. Our final main result is to obtain some sufficient conditions for the existence of a solution for a discrete-time optimality inequality and an ordinary optimal feedback control for the long run average cost using the so-called vanishing discount approach. Two examples are presented illustrating the possible applications of the results developed in the paper.

机译：本文讨论了分段确定性马尔可夫过程（PDMP）的长期平均连续控制问题，该过程采用一般Borel空间中的值，而紧凑状态空间取决于状态变量。控制变量作用于PDMP的跳变率和过渡度量，并且运行成本和边界成本假定为正，但不一定是有界的。我们的第一个主要结果是，根据与PDMP的跳后位置给出的嵌入式马尔可夫链相关的离散时间最优性方程，获得了长期平均成本的最优性方程。我们的第二个主要结果是通过在离散时间最优方程与积分微分方程之间建立联系来保证该离散最优方程具有反馈可测量选择器。我们的最终主要结果是，使用所谓的消失贴现法，为离散时间最优性不等式的解决方案和长期平均成本的普通最优反馈控制的存在获得充分的条件。给出了两个例子，说明了本文得出的结果的可能应用。

著录项

来源
《SIAM Journal on Control and Optimization》 |2010年第8期|共30页
作者
Costa OLV; Dufour F;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类运筹学;
关键词
piecewise deterministic Markov process; continuous-time; long run; average cost; optimal control; integro-differential optimality; equation; vanishing discount approach;

机译：分段确定性马尔可夫过程;连续时间;长期;平均成本;最优控制;积分微分最优;方程;消失贴现法;

相似文献

外文文献
中文文献
专利

1. Uniform Assymptotics in the Average Continuous Control of Piecewise Deterministic Markov Processes : Vanishing Approach [J] . Dan Goreac, Oana-Silvia Serea ESAIM: Proceedings and Surveys . 2014,第35期

机译：分段确定性马尔可夫过程的平均连续控制中的一致渐近性：消失法。
2. The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes [J] . O. L. V. Costa, F. Dufour Applied Mathematics & Optimization . 2010,第2期

机译：分段确定性马尔可夫过程平均连续控制的策略迭代算法
3. AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES [J] . Costa OLV, Dufour F SIAM Journal on Control and Optimization . 2010,第7a8期

机译：确定性马尔可夫过程的平均连续控制
4. The Policy Iteration Algorithm for Average Continuous Control of Piecewise Deterministic Markov Processes [C] . O.L.V. Costa, F. Dufour IEEE Conference on Decision and Control . 2009

机译：平均连续控制策略迭代算法，分段确定性马尔可夫进程
5. A hybrid genetic/optimization algorithm for piecewise affine and convex Markov decision processes. [D] . Lin, Zong-Zhi. 1999

机译：分段仿射和凸马尔可夫决策过程的混合遗传/优化算法。
6. Efficient analysis of stochastic gene dynamics in the non-adiabatic regime using piecewise deterministic Markov processes [O] . Yen Ting Lin, Nicolas E. Buchler 2018

机译：使用分段确定性马尔可夫过程对非绝热状态下的随机基因动力学进行有效分析
7. Uniform Assymptotics in the Average Continuous Control of Piecewise Deterministic Markov Processes : Vanishing Approach [O] . Dan Goreac, Oana-Silvia Serea 2014

机译：平均连续控制分段确定性马尔可夫流程的均匀轴承症：消失方法
8. Discrete-Time Controlled Markov Processes With Average Cost Criterion: A Survey. [R] . Arapostathis, A., Borkar, V. S., Fernandez- Gaucherand, E., 1992

机译：具有平均成本标准的离散时间控制马尔可夫过程：一项调查。

AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES

摘要

著录项

相似文献

相关主题

期刊订阅