...
首页> 外文期刊>Mathematics of operations research >Multiconstrained Finite-Horizon Piecewise Deterministic Markov Decision Processes with Unbounded Transition Rates
【24h】

Multiconstrained Finite-Horizon Piecewise Deterministic Markov Decision Processes with Unbounded Transition Rates

机译:多元素有限地平线分段确定型马尔可夫决策过程,具有无限的过渡率

获取原文
获取原文并翻译 | 示例
           

摘要

This paper studies a multiconstrained problem for piecewise deterministic Markov decision processes (PDMDPs) with unbounded cost and transition rates. The goal is to minimize one type of expected finite-horizon cost over history-dependent policies while keeping some other types of expected finite-horizon costs lower than some tolerable bounds. Using the Dynkin formula for the PDMDPs, we obtain an equivalent characterization of occupancy measures and express the expected finite-horizon costs in terms of occupancy measures. Under suitable assumptions, the existence of constrained-optimal policies is shown, the linear programming formulation and its dual program for the constrained problem are derived, and the strong duality between the two programs is established. An example is provided to demonstrate our results.
机译:本文研究了具有无限成本和转换率的分段确定性马尔可夫决策过程(PDMDP)的多元统一问题。 目标是最大限度地减少一种类型的预期有限范围成本,而不是历史依赖性政策,同时保持其他一些类型的预期有限范围成本低于某种可容忍的范围。 使用PDMDP的Dynkin公式,我们获得了占用率的等效表征,并在入住措施方面表达了预期的有限范围成本。 在合适的假设下,示出了受约束的最佳策略的存在,导出了线性编程配方及其用于约束问题的双程程序,并且建立了两个程序之间的强不良性。 提供了一个例子来展示我们的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号