首页> 美国政府科技报告 >Computational Comparison of Value Iteration Algorithms for Discounted Markov Decision Processes.
【24h】

Computational Comparison of Value Iteration Algorithms for Discounted Markov Decision Processes.

机译:马尔可夫决策过程的价值迭代算法计算比较。

获取原文

摘要

This note describes the results of a computational comparison of value iteration algorithms suggested for solving finite state discounted Markov decision processes. Such a process visits a set of states S = (1,2,...M). In Section two we describe the schemes examined and the various bounds that can be used for stopping them. Section three concentrates on one scheme that did well in the comparison - ordinary value iteration - and looks at various methods for eliminating non-optimal actions both permanently and temporarily.

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号