...
首页> 外文期刊>Stochastics: An International Journal of Probability and Stochastic Processes >Finite-horizon piecewise deterministic Markov decision processes with unbounded transition rates
【24h】

Finite-horizon piecewise deterministic Markov decision processes with unbounded transition rates

机译:有限地平线分段确定型马尔可夫决策过程,具有无限的过渡率

获取原文
获取原文并翻译 | 示例
           

摘要

This paper is concerned with the problem of minimizing the expected finite-horizon cost for piecewise deterministic Markov decision processes. The transition rates may be unbounded, and the cost functions are allowed to be unbounded from above and from below. The optimality is over the general history-dependent policies, where the control is continuously acting in time. The infinitesimal approach is employed to establish the associated Hamilton-Jacobi-Bellman equation, via which the existence of optimal policies is proved. An example is provided to verify all the assumptions proposed.
机译:本文涉及最小化分段确定性马尔可夫决策过程的预期有限范围成本的问题。 过渡率可能是无界的,并且允许成本函数从上方和下面的下方无限制。 最优性是依赖于一般历史依赖的政策,其中控制在时间不断行动。 使用无限的方法来建立相关的Hamilton-Jacobi-Bellman方程,证明了最佳政策的存在。 提供一个例子以验证所提出的所有假设。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号