首页> 美国政府科技报告 >Randomized Search Methods for Solving Markov Decision Processes and Global Optimization

【24h】

Randomized Search Methods for Solving Markov Decision Processes and Global Optimization

机译：求解马尔可夫决策过程和全局优化的随机搜索方法

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Markov decision process (MDP) models provide a unified framework for modeling and describing sequential decision making problems that arise in engineering, economics and computer science. However, when the underlying problem is modeled by MDPs there is a typical exponential growth in the size of the resultant MDP model with the size of the original problem, which makes practical solution of the MDP models intractable especially for large problems. Moreover, for complex systems, it is often the case that some of the parameters of the MDP models cannot be obtained in a feasible way, but only simulation samples are available. In the first part of this thesis, we develop two sampling/simulation-based numerical algorithms to address the computational difficulties arising from these settings. The proposed algorithms have somewhat different emphasis one algorithm focuses on MDPs with large state spaces but relatively small action spaces and emphasizes on the efficient allocation of simulation samples to find good value function estimates, whereas the other algorithm targets problems with large action spaces but small state spaces, and invokes a population-based approach to avoid carrying out an optimization over the entire action space. We study the convergence properties of these algorithms and report on computational results to illustrate their performance. The second part of this thesis is devoted to the development of a general framework called Model Reference Adaptive Search (MRAS) for solving global optimization problems. The method iteratively updates a parameterized probability distribution on the solution space, so that the sequence of candidate solutions generated from this distribution will converge asymptotically to the global optimum. We provide a particular instantiation of the framework and establish its convergence properties in both continuous and discrete domains.

著录项

作者
Hu, J;
展开▼
作者单位

展开▼
年度 2006
页码 1-229
总页数 229
原文格式 PDF
正文语种 eng
中图分类工业技术;
关键词
Mdp(Markov decision process);

机译：mdp（马尔可夫决策过程）;

相似文献

外文文献
中文文献
专利

1. An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes [J] . Jiaqiao Hu, Michael C. Fu, Vahid R. Ramezani, INFORMS journal on computing . 2007,第2期

机译：一种求解马尔可夫决策过程的进化随机策略搜索算法
2. Decomposition methods for solving Markov decision processes with multiple models of the parameters [J] . Lauren N. Steimle, Vinayak S. Ahluwalia, Charmee Kamdar, AIIE Transactions . 2021,第12期

机译：用多个模型解决Markov决策过程的分解方法
3. Comparison of two methods for solving Markov Decision Processes in the persecution-evasion problem [J] . Michel Garc?a, Cinhtia Gonz?lez, Enrique Succar, International journal of computer science and network security . 2010,第4期

机译：迫害回避问题中两种求解马尔可夫决策过程的方法的比较
4. A Model Reference Adaptive Search Method for Stochastic Optimization with Applications to Markov Decision Processes [C] . Jiaqiao Hu, Michael C. Fu, Steven I. Marcus IEEE Conference on Decision and Control . 2007

机译：用于马尔可夫决策过程的随机优化的模型参考自适应搜索方法
5. Randomized search methods for solving Markov decision processes and global optimization. [D] . Hu, Jiaqiao. 2006

机译：解决马尔可夫决策过程和全局优化的随机搜索方法。
6. A hybrid cuckoo search algorithm with Nelder Mead method for solving global optimization problems [O] . Ahmed F. Ali, Mohamed A. Tawhid -1

机译：Nelder Mead混合杜鹃搜索算法求解全局优化问题
7. An Evolutionary Random Policy Search Algorithm for Solving Markov Decision Processes [O] . Jiaqiao Hu, Michael C. Fu, Vahid R. Ramezani, 2007

机译：一种求解马尔可夫决策过程的进化随机策略搜索算法

Randomized Search Methods for Solving Markov Decision Processes and Global Optimization

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅