首页> 外文会议>International Conference on Logic Programming and Nonmonotonic Reasoning >Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language pBC+
【24h】

Elaboration Tolerant Representation of Markov Decision Process via Decision-Theoretic Extension of Probabilistic Action Language pBC+

机译:通过概率动作语言pBC +的决策理论扩展进行马尔可夫决策过程的精细容忍表示

获取原文

摘要

We extend probabilistic action language pBC+ with the notion of utility in decision theory. The semantics of the extended pBC+ can be defined as a shorthand notation for a decision-theoretic extension of the probabilistic answer set programming language LP~(MLN). Alternatively, the semantics of pBC+ can also be defined in terms of Markov Decision Process (MDP), which in turn allows for representing MDP in a succinct and elaboration tolerant way as well as leveraging an MDP solver to compute a pBC+ action description. The idea led to the design of the system pbcplus2mdp, which can find an optimal policy of a pBC+ action description using an MDP solver.
机译:我们用决策理论中的效用概念扩展了概率动作语言pBC +。扩展的pBC +的语义可以定义为概率答案集编程语言LP〜(MLN)的决策理论扩展的简写符号。备选地,还可以根据马尔可夫决策过程(MDP)来定义pBC +的语义,这反过来又允许以简洁明了和容忍的方式表示MDP,以及利用MDP求解器来计算pBC +动作描述。这个想法导致了系统pbcplus2mdp的设计,该系统可以使用MDP求解器找到pBC +动作描述的最佳策略。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号