首页> 外文会议>Fuzzy Systems and Knowledge Discovery; Lecture Notes in Artificial Intelligence; 4223 >On the Markovian Randomized Strategy of Controller for Markov Decision Processes
【24h】

On the Markovian Randomized Strategy of Controller for Markov Decision Processes

机译:马尔可夫决策过程控制器的马尔可夫随机策略

获取原文
获取原文并翻译 | 示例

摘要

This paper focuses on the so called controller synthesis problem, which addresses the question of how to limit the internal behavior of a given system implementation to meet its specification, regardless of the behavior enforced by the environment. We consider this problem in the probabilistic setting, where the underlying model has both probabilism and nondeterminism and the nondeterministic choices in some states are assumed to be controllable while the others are under the control of an unpredictable environment. As for the specification, it is defined by probabilistic computation tree logic. We show that under the restriction that the controller exploits only Markovian randomized strategy, the existence of such a controller is decidable, which is done by a reduction to the decidability of first-order theory for reals. This also gives rise to an algorithm which can synthesize the controller if it does exist.
机译:本文着重于所谓的控制器综合问题,该问题解决了如何限制给定系统实现的内部行为以满足其规范的问题,而不受环境强制执行的行为的影响。我们在概率环境中考虑此问题,其中基础模型同时具有概率和不确定性,并且在某些状态下的不确定性选择被认为是可控制的,而在其他状态下则处于不可预测的环境中。至于规范,它是由概率计算树逻辑定义的。我们表明,在控制器仅利用马尔可夫随机策略的限制下,这种控制器的存在是可确定的,这是通过减少实数的一阶理论的可确定性来完成的。这也产生了一种算法,该算法可以合成控制器(如果存在)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号