首页> 外文期刊>SIAM Journal on Control and Optimization >AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES
【24h】

AVERAGE CONTINUOUS CONTROL OF PIECEWISE DETERMINISTIC MARKOV PROCESSES

机译:确定性马尔可夫过程的平均连续控制

获取原文
获取原文并翻译 | 示例
           

摘要

This paper deals with the long run average continuous control problem of piecewise deterministic Markov processes (PDMPs) taking values in a general Borel space and with compact action space depending on the state variable. The control variable acts on the jump rate and transition measure of the PDMP, and the running and boundary costs are assumed to be positive but not necessarily bounded. Our first main result is to obtain an optimality equation for the long run average cost in terms of a discrete-time optimality equation related to the embedded Markov chain given by the postjump location of the PDMP. Our second main result guarantees the existence of a feedback measurable selector for the discrete-time optimality equation by establishing a connection between this equation and an integro-differential equation. Our final main result is to obtain some sufficient conditions for the existence of a solution for a discrete-time optimality inequality and an ordinary optimal feedback control for the long run average cost using the so-called vanishing discount approach. Two examples are presented illustrating the possible applications of the results developed in the paper.
机译:本文讨论了分段确定性马尔可夫过程(PDMP)的长期平均连续控制问题,该过程采用一般Borel空间中的值,而紧凑状态空间取决于状态变量。控制变量作用于PDMP的跳变率和过渡度量,并且运行成本和边界成本假定为正,但不一定是有界的。我们的第一个主要结果是,根据与PDMP的跳后位置给出的嵌入式马尔可夫链相关的离散时间最优性方程,获得了长期平均成本的最优性方程。我们的第二个主要结果是通过在离散时间最优方程与积分微分方程之间建立联系来保证该离散最优方程具有反馈可测量选择器。我们的最终主要结果是,使用所谓的消失贴现法,为离散时间最优性不等式的解决方案和长期平均成本的普通最优反馈控制的存在获得充分的条件。给出了两个例子,说明了本文得出的结果的可能应用。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号