Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes

机译：多目标马尔可夫决策过程中的Lorenz最优解的逼近

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper is devoted to fair optimization in Multiobjective Markov Decision Processes (MOMDPs). A MOMDP is an extension of the MDP model for planning under uncertainty while trying to optimize several reward functions simultaneously. This applies to multiagent problems when rewards define individual utility functions, or in multicrite-ria problems when rewards refer to different features. In this setting, we study the determination of policies leading to Lorenz-non-dominated tradeoffs. Lorenz dominance is a refinement of Pareto dominance that was introduced in Social Choice for the measurement of inequalities. In this paper, we introduce methods to efficiently approximate the sets of Lorenz-non-dominated solutions of infinite-horizon, discounted MOMDPs. The approximations are polynomial-sized subsets of those solutions.

机译：本文致力于多目标马尔可夫决策过程（MOMDP）中的公平优化。 MOMDP是MDP模型的扩展，用于在不确定性下进行计划，同时尝试同时优化多个奖励功能。当奖励定义了单个效用函数时，这适用于多主体问题;当奖励指的是不同的功能时，这适用于多记录问题。在这种情况下，我们研究导致Lorenz非主导权衡的政策确定。 Lorenz主导地位是对Pareto主导地位的改进，它是在Social Choice中引入的，用于衡量不平等。在本文中，我们介绍了有效逼近无限水平折扣MOMDP的Lorenz非支配解集的方法。近似值是这些解的多项式大小的子集。

著录项

来源
《Conference on uncertainty in artificial intelligence》|2013年|508-517|共10页
会议地点
作者
Patrice Perny; Paul Weng; Judy Goldsmith; Josiah P. Hanna;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. An approximation approach for the deviation matrix of continuous-time Markov processes with application to Markov decision theory [J] . Leder N., Heidergott B., Hordijk A. Operations Research: The Journal of the Operations Research Society of America . 2010,第4aPta1期

机译：连续时间马尔可夫过程偏差矩阵的一种近似方法及其在马尔可夫决策理论中的应用
2. Asymptotic Optimality of Finite Model Approximations for Partially Observed Markov Decision Processes With Discounted Cost [J] . Saldi Naci, Yuksel Serdar, Linder Tamas IEEE Transactions on Automatic Control . 2020,第1期

机译：有限模型近似的渐近最优折扣折扣判决过程的有限模型近似
3. Finite-State Approximations to Discounted and Average Cost Constrained Markov Decision Processes [J] . Naci Saldi IEEE Transactions on Automatic Control . 2019,第7期

机译：折扣和平均成本约束的马尔可夫决策过程的有限状态近似
4. Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes [C] . Patrice Perny, Paul Weng, Judy Goldsmith, Conference on Uncertainty in Artificial Intelligence . 2013

机译：多目标马尔可夫决策过程中Lorenz最优解的近似值
5. Linear approximations for factored Markov decision processes. [D] . Patrascu, Relu-Eugen. 2005

机译：因子马尔可夫决策过程的线性近似。
6. Data-Driven Markov Decision Process Approximations for PersonalizedHypertension Treatment Planning [O] . Greggory J. Schell, Wesley J. Marrero, Mariel S. Lavieri, 2016

机译：数据驱动的个性化马尔可夫决策过程近似高血压治疗计划
7. On Minimizing Ordered Weighted Regrets in Multiobjective Markov Decision Processes [O] . Wlodzimierz Ogryczak, Patrice Perny, Paul Weng 2013

机译：多目标马尔可夫决策过程中有序加权后悔的最小化

Approximation of Lorenz-Optimal Solutions in Multiobjective Markov Decision Processes

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅