...
首页> 外文期刊>IEEE Transactions on Automatic Control >Two-Time Scale Controlled Markov Chains: A Decomposition and Parallel Processing Approach
【24h】

Two-Time Scale Controlled Markov Chains: A Decomposition and Parallel Processing Approach

机译:二次尺度控制的马尔可夫链:分解和并行处理方法

获取原文
获取原文并翻译 | 示例

摘要

This correspondence deals with a class of ergodic control problems for systems described by Markov chains with strong and weak interactions. These systems are composed of a set of subchains that are weakly coupled. Using results already available in the literature one formulates a limit control problem the solution of which can be obtained via an associated nondifferentiable convex programming (NDCP) problem. The technique used to solve the NDCP problem is the Analytic Center Cutting Plane Method (ACCPM) which implements a dialogue between, on one hand, a master program computing the analytical center of a localization set containing the solution and, on the other hand, an oracle proposing cutting planes that reduce the size of the localization set at each main iteration. The interesting aspect of this implementation comes from two characteristics: (i) the oracle proposes cutting planes by solving reduced sized Markov Decision Problems (MDP) via a linear program (LP) or a policy iteration method; (ii) several cutting planes can be proposed simultaneously through a parallel implementation on processors. The correspondence concentrates on these two aspects and shows, on a large scale MDP obtained from the numerical approximation ldquoa la Kushner-Dupuisrdquo of a singularly perturbed hybrid stochastic control problem, the important computational speed-up obtained.
机译:这种对应关系解决了由马尔可夫链描述的具有强相互作用和弱相互作用的系统的遍历控制问题。这些系统由一组弱耦合的子链组成。使用文献中已有的结果,可以制定一个极限控制问题,可以通过相关的不可微凸编程(NDCP)问题来解决该问题。解决NDCP问题的技术是分析中心切割平面方法(ACCPM),该方法一方面实现了计算包含解决方案的定位集的分析中心的主程序,另一方面实现了解决方案。 oracle建议使用切割平面,以减少每次主迭代时设置的本地化大小。此实现的有趣方面来自两个特征:(i)oracle通过通过线性程序(LP)或策略迭代方法解决尺寸减小的Markov决策问题(MDP)来提出切割平面的建议; (ii)通过在处理器上并行实施,可以同时提出几个切割平面。对应关系集中在这两个方面,并且从奇异摄动混合随机控制问题的数值近似“ ldquoa la Kushner-Dupuisrdquo”获得的大规模MDP上显示出所获得的重要计算速度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号