...
首页> 外文期刊>Operations Research Letters: A Journal of the Operations Research Society of America >Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming
【24h】

Modified policy iteration algorithms are not strongly polynomial for discounted dynamic programming

机译:对于折扣动态规划,修改后的策略迭代算法不是强多项式

获取原文
获取原文并翻译 | 示例
           

摘要

This note shows that the number of arithmetic operations required by any member of a broad class of optimistic policy iteration algorithms to solve a deterministic discounted dynamic programming problem with three states and four actions may grow arbitrarily. Therefore any such algorithm is not strongly polynomial. In particular, the modified policy iteration and λ-policy iteration algorithms are not strongly polynomial.
机译:此注释显示,乐观的策略迭代算法的大类中的任何成员解决具有三个状态和四个动作的确定性折扣动态规划问题所需要的算术运算数量可以任意增加。因此,任何这样的算法都不是强多项式。特别地,修改后的策略迭代和λ策略迭代算法不是强多项式。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号