【24h】

SYMBOLIC DYNAMIC PROGRAMMING WITHIN THE FLUENT CALCULUS

机译:流质计算中的符号动态编程

获取原文
获取原文并翻译 | 示例

摘要

A symbolic dynamic programming approach for modelling first-order Markov decision processes within the fluent calculus is given. Based on an idea initially presented in [3], the major components of Markov decision processes such as the optimal value function and a policy are logically represented. The technique produces a set of first-order formulae with equality that minimally partitions the state space. Consequently, the symbolic dynamic programming algorithm presented here does not require to enumerate the state and action spaces, thereby solving a drawback of classical dynamic programming methods. In addition, we illustrate how conditional actions and specificity can be modelled by the approach.
机译:给出了一种在动态演算中建模一阶马尔可夫决策过程的符号动态规划方法。基于最初在[3]中提出的想法,在逻辑上表示了马尔可夫决策过程的主要组成部分,例如最优价值函数和策略。该技术生成具有相等性的一组一阶公式,以最小化状态空间的划分。因此,这里提出的符号动态编程算法不需要枚举状态和动作空间,从而解决了传统动态编程方法的缺点。此外,我们说明了该方法如何模拟条件行为和特异性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号