首页> 外文会议>IFAC World Congress >A Continuous-time Markov Decision Process Based Method on Pursuit-Evasion Problem

【24h】

A Continuous-time Markov Decision Process Based Method on Pursuit-Evasion Problem

机译：基于追求逃避问题的连续时间马尔可夫决策过程

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This paper presents a method to address the pursuit-evasion problem which incorporates the behaviors of the opponent, in which a continuous-time Markov decision process (CTMDP) model is introduced, where the significant difference from Markov decision process (MDP) is that the influence of the transition time between the states is taken into account. By introducing the concept of situation, the probabilities addressing average behaviors are obtained. Furthermore, these probabilities are introduced to construct the transition matrix in the CTMDP. A policy iteration method for solving the CTMDP is also given. To demonstrate the CTMDP method for pursuit-evasion, examples in a grid environment are computed. The CTMDP-based method presented in this paper offers a new approach to pursuit-evasion modeling and may be extended to similar problems in the sequential decision process.

机译：本文提出了一种解决追求逃避问题的方法，该方法包括对手的行为，其中引入了连续时间马尔可夫决策过程（CTMDP）模型，其中来自马尔可夫决策过程（MDP）的显着差异是考虑到各州之间的过渡时间的影响。通过引入情况的概念，获得了寻址平均行为的概率。此外，引入了这些概率来构建CTMDP中的转换矩阵。还给出了解决CTMDP的策略迭代方法。为了展示追求逃守的CTMDP方法，计算网格环境中的示例。本文提出的基于CTMDP的方法提供了一种追求逃避模型的新方法，并且可以扩展到序贯决策过程中的类似问题。

著录项

来源
《IFAC World Congress》|2014年||共6页
会议地点
作者
Jia Shengde; Wang Xiangke; Ji Xiaoting; Zhu Huayong;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP273-53;
关键词
Pursuit-Evasion; Continuous-time Markov Decision Process; Transition Rates Matrix; Dynamic Programming; Policy Iteration;

机译：追求逃避;连续时间马尔可夫决策过程;转换率矩阵;动态规划;政策迭代;

相似文献

外文文献
中文文献
专利

1. A Continuous-Time Markov Decision Process-Based Method With Application in a Pursuit-Evasion Example [J] . Shengde Jia, Xiangke Wang, Lincheng Shen IEEE Transactions on Systems, Man, and Cybernetics . 2016,第9期

机译：基于连续时间马尔可夫决策过程的方法在逃避实例中的应用
2. The Transformation Method for Continuous-Time Markov Decision Processes [J] . Piunovskiy A., Zhang Y. Journal of Optimization Theory and Applications . 2012,第2期

机译：连续时间马尔可夫决策过程的变换方法
3. The Transformation Method for Continuous-Time Markov Decision Processes [J] . Alexey Piunovskiy, Yi Zhang Journal of Optimization Theory and Applications . 2012,第2期

机译：连续时间马尔可夫决策过程的变换方法
4. A Continuous-time Markov Decision Process Based Method on Pursuit-Evasion Problem [C] . Jia Shengde, Wang Xiangke, Ji Xiaoting, IFAC World Congress . 2014

机译：基于追求逃避问题的连续时间马尔可夫决策过程
5. Modern Methods of Hidden Markov Models and Partially Observable Markov Decision Processes in Biostatistics [D] . Xu, Zekun. 2020

机译：隐藏马尔可夫模型的现代方法和止痛性的部分可观察马尔可夫决策过程
6. Using model-based proposals for fast parameter inference on discrete state space continuous-time Markov processes [O] . C. M. Pooley, S. C. Bishop, G. Marion 2015

机译：使用基于模型的建议对离散状态空间连续时间马尔可夫过程进行快速参数推断
7. Dynamic Power Management Based on Continuous-Time Markov Decision Processes [O] . Qinru Qiu, Massoud Pedram 1999

机译：基于连续时间马尔可夫决策过程的动态电源管理

A Continuous-time Markov Decision Process Based Method on Pursuit-Evasion Problem

摘要

著录项

相似文献

相关主题

期刊订阅