首页> 外文会议>Annual conference on genetic and evolutionary computation >Evolving Policies for Multi-Reward Partially Observable Markov Decision Processes (MR-POMDPs)
【24h】

Evolving Policies for Multi-Reward Partially Observable Markov Decision Processes (MR-POMDPs)

机译:不断变化的多奖励部分可观察马尔可夫决策过程(MR-POMDPS)

获取原文

摘要

Plans and decisions in many real-world scenarios are made under uncertainty and to satisfy multiple, possibly conflicting, objectives. In this work, we contribute the multi-reward partially-observable Markov decision process (MR-POMDP) as a general modelling framework. To solve MR-POMDPs, we present two hybrid (memetic) multi-objective evolutionary algorithms that generate non-dominated sets of policies (in the form of stochastic finite state controllers). Performance comparisons between the methods on multi-objective problems in robotics (with 2, 3 and 5 objectives), web-advertising (with 3, 4 and 5 objectives) and infectious disease control (with 3 objectives), revealed that memetic variants outperformed their original counterparts. We anticipate that the MR-POMDP along with multi-objective evolutionary solvers will prove useful in a variety of theoretical and real-world applications.
机译:在许多现实世界方案中的计划和决策是在不确定性的不确定性和满足多重,可能相互冲突的目标中的。在这项工作中,我们为一般建模框架提供了多奖励部分可观察的马尔可夫决策过程(MR-POMDP)。为了解决MR-POMDPS,我们呈现了两个混合(麦片)多目标进化算法,产生非主导的政策组(以随机有限状态控制器的形式)。机器人中多目标问题的方法与2,3和5个目标)的性能比较,网络广告(具有3,4和5个目标)和传染病控制(具有3个目标),揭示了膜变体表现优于其原始对应物。我们预计MR-POMDP以及多目标进化求解器将在各种理论和现实世界应用中证明是有用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号