A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies1）

TANGHao; XIHong-Sheng; YINBao-Qun

首页> 中文期刊> 《自动化学报》 >A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies1）

A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies1）

开具论文收录证明 >>

期刊封面封底目录下载 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

cqvip:Based on the theory of Markov performance potentials and neuro-dynamic programming(NDP) methodology, we study simulation optimization algorithm for a class of continuous timeMarkov decision processes (CTMDPs) under randomized stationary policies. The proposed algo-rithm will estimate the gradient of average cost performance measure with respect to policy param-eters by transforming a continuous time Markov process into a uniform Markov chain and simula-ting a single sample path of the chain. The goal is to look for a suboptimal randomized stationarypolicy. The algorithm derived here can meet the needs of performance optimization of many diffi-cult systems with large-scale state space. Finally, a numerical example for a controlled Markovprocess is provided.

著录项

来源
《自动化学报》 |2004年第2期|P.229-234|共6页
作者
TANGHao; XIHong-Sheng; YINBao-Qun;
展开▼
作者单位

DepartmentofAutomation UniversityofScienceandTechnologyofChina Hefei230026;

展开▼
原文格式 PDF
正文语种 chi
中图分类计算机仿真;
关键词
仿真优化算法; 随机平稳策略; CTMDP; Markov性能势理论;

相似文献

中文文献
外文文献

1. NUMERICAL SIMULATION ALGORITHM FOR RELIABILITY ANALYSIS OF COMPLEX STRUCTURAL SYSTEM BASED ON INTELLIGENT OPTIMIZATION [J] . . 中国机械工程学报 . 2006,第001期
2. On Convergence Behaviors of Relaxation-based Algorithms in Circuit Simulation [C] . . 中国电子学会电路与系统学会第十六届年会 . 2001
3. Dynamic Optimization of E--commerce Logistics Network Based on Machine Learning Algorithms [A] . Chen Zou . 2020

A Simulation Optimization Algorithm for CTMDPs Based on Randomized Stationary Policies1）

摘要

著录项

相似文献

相关主题

期刊订阅