A Lower Bounding Linear Programming approach to the Perimeter Patrol Stochastic Control Problem

机译：周边巡逻随机控制问题的下限线性规划方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

One encounters the curse of dimensionality in the application of dynamic programming to determine optimal policies for large scale controlled Markov chains. In this article, we consider a perimeter patrol stochastic optimal control problem. To determine the optimal control policy, one has to solve a Markov decision problem, whose large size renders exact dynamic programming methods intractable. So, we propose a state aggregation based approximate linear programming method to construct provably good sub-optimal policies instead. The state-space is partitioned and the optimal cost-to-go or value function is restricted to be a constant over each partition. We show that the resulting restricted system of linear inequalities embeds a family of Markov chains of lower dimension, one of which can be used to construct a tight lower bound on the optimal value function. In general, the construction of the lower bound requires the solution to a combinatorial problem. But the perimeter patrol problem exhibits a special structure that enables tractable linear programming formulation for the lower bound. We demonstrate this and also provide numerical results that corroborate the efficacy of the proposed methodology.

机译：在动态规划应用中，遇到维度的诅咒，以确定大规模控制的马尔可夫链的最佳政策。在本文中，我们考虑了一个周边巡逻随机最佳控制问题。要确定最佳控制策略，必须解决马尔可夫决策问题，其大尺寸呈现精确的动态编程方法难以解返。因此，我们提出了一种基于国家聚合的近似线性编程方法来构建可释放的良好的次优策略。状态空间被分区，并且最佳成本转到或value函数被限制为在每个分区上的常量。我们表明，由此产生的限制线性不平等系统嵌入了一系列Markov链条的下尺寸，其中一个可用于在最佳值函数上构造紧密的下限。通常，下限的构建需要解决组合问题。但周边巡逻问题表现出一种特殊的结构，使得能够为下限提供易于线性编程配方。我们证明了这一点，还提供了证实拟议方法的功效的数值结果。

著录项

来源
《AIAA infotech@aerospace conference and exhibit》|2012年||共12页
会议地点
作者
K. Krishnamoorthy; S. Darbha; M. Park; M. Pachter; P. Chandler; D. Casbeer;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类航空仪表、航空设备、飞行控制与导航;
关键词

相似文献

外文文献
中文文献
专利

1. 关于吴方法应用在非线性规划问题中的几个测试问题 [J] . 吴天骄数学季刊：英文版 . 1994,第002期
2. Lower Bounding Linear Program for the Perimeter Patrol Optimization Problem [J] . Krishnamoorthy Kalyanam, Myoungkuk Park, Swaroop Darbha, Journal of guidance, control, and dynamics . 2014,第2期

机译：边界巡逻优化问题的下界线性规划
3. Stochastic linear programming with scarce information: an approach from expected utility and bounded rationality applied to the textile industry [J] . ENRIQUE BALLESTERO Engineering Optimization . 2006,第4期

机译：信息稀缺的随机线性规划：一种预期效用和有限理性的方法应用于纺织业
4. ASYMPTOTIC LOWER BOUNDS FOR OPTIMAL TRACKING: A LINEAR PROGRAMMING APPROACH [J] . Cai Jiatu, Rosenbaum Mathieu, Tankov Peter The Annals of applied probability: an official journal of the Institute of Mathematical Statistics . 2017,第4期

机译：最佳跟踪的渐近下限：线性规划方法
5. A Lower Bounding Linear Programming approach to the Perimeter Patrol Stochastic Control Problem [C] . K. Krishnamoorthy, S. Darbha, M. Park, AIAA infotech@aerospace conference and exhibit . 2012

机译：边界巡逻随机控制问题的下界线性规划方法
6. Lower Bounds for Interactive Compression and Linear Programs [D] . Sinha, Makrand 2018

机译：交互式压缩和线性程序的下界
7. On the linear programming bound for linear Lee codes [O] . Helena Astola, Ioan Tabus -1

机译：关于线性Lee码的线性规划界
8. A fault-tolerant modular control approach to multi-robot perimeter patrol [O] . Ro Marino, Lynne E. Parker, Gianluca Antonelli, 2010

机译：多机器人周边巡逻的容错模块化控制方法

A Lower Bounding Linear Programming approach to the Perimeter Patrol Stochastic Control Problem

摘要

著录项

相似文献

相关主题

期刊订阅