首页> 外文会议>IEEE Annual Conference on Decision and Control >Topological Approximate Dynamic Programming under Temporal Logic Constraints
【24h】

Topological Approximate Dynamic Programming under Temporal Logic Constraints

机译:拓扑近似动态规划在时间逻辑约束下

获取原文

摘要

In this paper, we develop a model-free approximate dynamic programming method for stochastic systems modeled as Markov decision processes to maximize the probability of satisfying high-level system specifications expressed in a subclass of temporal logic formulas-syntactically co-safe linear temporal logic. Our proposed method includes two steps: First, we decompose the planning problem into a sequence of sub-problems based on the topological property of the task automaton which is translated from a temporal logic formula. Second, we extend a model-free approximate dynamic programming method to solve value functions, one for each state in the task automaton, in an order reverse to the causal dependency. Particularly, we show that the run-time of the proposed algorithm does not grow exponentially with the size of specifications. The correctness and efficiency of the algorithm are demonstrated using a robotic motion planning example.
机译:在本文中,我们开发了一种无模型近似动态编程方法,用于为马尔可夫决策过程建模的随机系统,以最大化满足时间逻辑公式 - 句法共同安全线性时间逻辑的子类中表达的高级系统规范的概率。我们所提出的方法包括两个步骤:首先,我们将规划问题分解为基于从时间逻辑公式转换的任务自动机的拓扑属性的子问题序列。其次,我们扩展了一种无模型近似动态编程方法来解决值函数,一个用于任务自动机中的每个状态的值,以反向对因果依赖性的顺序。特别是,我们表明所提出的算法的运行时间与规格的大小呈指数级增长。使用机器人运动规划示例对算法的正确性和效率进行了说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号