...
首页> 外文期刊>International Journal of Information Technology & Decision Making >A COMPROMISE PROGRAMMING APPROACH TO MULTIOBJECTIVE MARKOV DECISION PROCESSES
【24h】

A COMPROMISE PROGRAMMING APPROACH TO MULTIOBJECTIVE MARKOV DECISION PROCESSES

机译:多目标马尔可夫决策过程的一种妥协编程方法

获取原文
获取原文并翻译 | 示例

摘要

A Markov decision process (MDP) is a general model for solving planning problems under uncertainty. It has been extended to multiobjective MDP to address multicriteria or multiagent problems in which the value of a decision must be evaluated according to several viewpoints, sometimes conflicting. Although most of the studies concentrate on the determination of the set of Pareto-optimal policies, we focus here on a more specialized problem that concerns the direct determination of policies achieving well-balanced tradeoffs. To this end, we introduce a reference point method based on the optimization of a weighted ordered weighted average (WOWA) of individual disachievements. We show that the resulting notion of optimal policy does not satisfy the Bellman principle and depends on the initial state. To overcome these difficulties, we propose a solution method based on a linear programming (LP) reformulation of the problem. Finally, we illustrate the feasibility of the proposed method on two types of planning problems under uncertainty arising in navigation of an autonomous agent and in inventory management.
机译:马尔可夫决策过程(MDP)是解决不确定性下计划问题的通用模型。它已扩展到多目标MDP,以解决多准则或多主体问题,其中决策的价值必须根据几种观点进行评估,有时会相互冲突。尽管大多数研究都集中在确定一组帕累托最优政策上,但我们这里集中在一个更专业的问题上,该问题涉及直接确定实现均衡平衡的政策。为此,我们介绍了一种基于参考值的方法,该方法基于单个成就的加权有序加权平均值(WOWA)的优化。我们证明最优政策的结果概念不满足Bellman原理,而是取决于初始状态。为了克服这些困难,我们提出了一种基于线性规划(LP)重新编写问题的解决方法。最后,我们说明了在自主代理商导航和库存管理中出现不确定性的情况下,该方法对两种类型的计划问题的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号