首页> 外文会议>American Conference on Applied Mathematics >A Lyapunov Shortest-Path Characterization for Markov Decision Processes
【24h】

A Lyapunov Shortest-Path Characterization for Markov Decision Processes

机译:Markov决策过程的Lyapunov最短路径表征

获取原文

摘要

In this paper we introduce a modeling paradigm for developing decision process representation for shortest-path problems. Whereas, in previous work attention was restricted to tracking the net using Bellman's equation as a utility function, this work uses a Lyapunov-like function. In this sense, we are changing the traditional cost function by a trajectory-tracking function which is also an optimal cost-to-target function for tracking the net. The main point of the Markov decision process is its ability to represent the system-dynamic and trajectory-dynamic properties of a decision process. Within the system-dynamic properties framework we prove new notions of equilibrium and stability. In the trajectory-dynamic properties framework, we optimize the value of the trajectory-function used for path planning via a Lyapunov-like function, obtaining as a result new characterizations for final decision points (optimum points) and stability. Moreover, we show that the system-dynamic and Lyapunov trajectory-dynamic properties of equilibrium, stability and final decision points (optimum points) meet under certain restrictions.
机译:在本文中,我们介绍了用于开发决策过程表示以获得最短路径问题的建模范式。虽然,在以前的工作中,仅限于跟踪使用Bellman的方程作为实用程序函数跟踪网络,但这项工作使用Lyapunov样功能。从这个意义上讲,我们正在通过轨迹跟踪功能改变传统的成本函数,这也是跟踪网络的最佳成本函数。马尔可夫决策过程的要点是它代表决策过程的系统动态和轨迹动态属性的能力。在系统 - 动态属性框架内,我们证明了新的均衡和稳定性概念。在轨迹 - 动态属性框架中,我们通过Lyapunov样功能优化用于路径规划的轨迹函数的值,从而获得最终决策点(最佳点)和稳定性的新特征。此外,我们表明系统动态和Lyapunov轨迹 - 平衡,稳定性和最终决策点(最佳点)的动态性质在某些限制下满足。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号