首页> 外文期刊>The Journal of Artificial Intelligence Research >A Survey of Multi-Objective Sequential Decision-Making
【24h】

A Survey of Multi-Objective Sequential Decision-Making

机译:多目标顺序决策研究

获取原文
获取原文并翻译 | 示例
           

摘要

Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This article surveys algorithms designed for sequential decision-making problems with multiple objectives. Though there is a growing body of literature on this subject, little of it makes explicit under what circumstances special methods are needed to solve multi-objective problems. Therefore, we identify three distinct scenarios in which converting such a problem to a single-objective one is impossible, infeasible, or undesirable. Furthermore, we propose a taxonomy that classifies multi-objective methods according to the applicable scenario, the nature of the scalarization function (which projects multi-objective values to scalar ones), and the type of policies considered. We show how these factors determine the nature of an optimal solution, which can be a single policy, a convex hull, or a Pareto front. Using this taxonomy, we survey the literature on multi-objective methods for planning and learning. Finally, we discuss key applications of such methods and outline opportunities for future work.
机译:具有多个目标的顺序决策问题在实践中很自然地出现,并且对决策理论规划和学习的研究提出了独特的挑战,而决策理论的规划和学习主要集中于单目标环境。本文概述了针对具有多个目标的顺序决策问题而设计的算法。尽管关于这一主题的文献越来越多,但很少有文献明确指出在什么情况下需要特殊的方法来解决多目标问题。因此,我们确定了三种不同的方案,在这些方案中,不可能,无法实现或不希望将这种问题转换为单目标的问题。此外,我们提出了一种分类法,该分类法根据适用场景,标量函数的性质(将多目标值投影到标量值)和所考虑的策略类型对多目标方法进行分类。我们将说明这些因素如何确定最佳解决方案的性质,该解决方案可以是单个策略,凸包或Pareto前沿。使用这种分类法,我们调查了有关计划和学习的多目标方法的文献。最后,我们讨论了此类方法的关键应用,并概述了未来工作的机会。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号