首页> 外文会议>NSF Design Manufacturing Research Conference >Nonhomogeneous Markov Decision Processes
【24h】

Nonhomogeneous Markov Decision Processes

机译:非均质马尔可夫决策过程

获取原文

摘要

We study the nonhomogeneous Markov decision process with an objective of developing theory and both optimal and heuristic algorithms capable of solving moderately sized problems. Our approach is to identify forecast horizons by exploiting problem structure. Truncating the problem at a specific horizon, we can search a set of potential salvage values to determine if we have a forecast horizon. The approach developed combines genetic algorithms to do a quick search, and a mixed integer program to verify optimality. This approach is generalized to any model of sequential decision-making for which a successive approximation step is closed with respect to the set of all piecewise affine and convex functions. Such models include many parametrized Markov decision processes (e.g., the multi-objective Markov decision process and the partially observed Markov decision process).
机译:我们研究了非均匀马尔可夫决策过程,目的是发展理论和能够解决中等大小问题的最佳和启发式算法。我们的方法是通过利用问题结构来识别预测视野。截断特定视野的问题,我们可以搜索一组潜在的救助价值,以确定我们是否有预测地平线。该方法开发结合遗传算法来进行快速搜索,以及混合整数程序以验证最优性。该方法是推广到任何连续决策模型的模型,其中相对于所有分段仿射和凸函数的集合闭合了连续的近似步骤。这些模型包括许多参数化的马尔可夫决策过程(例如,多目标马尔可夫决策过程和部分观察到的马尔可夫决策过程)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号