A note on the structure of value spaces in vector-valued Markov decision processes

Kazuyoshi Wakuta

首页> 外文期刊>Mathematical methods of operations research >A note on the structure of value spaces in vector-valued Markov decision processes

【24h】

A note on the structure of value spaces in vector-valued Markov decision processes

机译：关于向量值马尔可夫决策过程中值空间结构的注释

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

For a vector-valued Markov Decision process with discounted reward criterion, we study the structure of its value spaces defined for all initial states. At first we discuss the relationship between the value spaces, i.e. we verify a linking property for optimality. We next show that if the values of deterministic stationary policies generate a face of the value space, any point of that face can be represented as the value of a randomization of these policies. We also examine whether the value of a randomization of deterministic stationary policies lies on the face generated by the values of these policies.

机译：对于具有折现奖励标准的向量值马尔可夫决策过程，我们研究了为所有初始状态定义的其值空间的结构。首先，我们讨论值空间之间的关系，即验证链接属性的最优性。接下来，我们表明，如果确定性平稳策略的值生成了值空间的面，则该面的任何点都可以表示为这些策略的随机值。我们还研究了确定性平稳策略的随机化值是否位于这些策略的值所产生的面孔上。

著录项

来源
《Mathematical methods of operations research》 |1999年第1期|共9页
作者
Kazuyoshi Wakuta;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类数学;
关键词
markov decision process; vector-valued reward; value spaces; randomization of policies;

机译：马可夫决策过程;向量值奖励;价值空间;政策随机化;

相似文献

外文文献
中文文献
专利

1. A note on the structure of value spaces in vector-valued Markov decision processes [J] . Kazuyoshi Wakuta Mathematical methods of operations research . 1999,第1期

机译：关于向量值马尔可夫决策过程中值空间结构的注释
2. VECTOR-VALUED MARKOV DECISION PROCESSES WITH AVERAGE REWARD CRITERION: THE MULTICHAIN CASE [J] . Kazuyoshi Wakuta 20f Probability in the Engineering and Informational Sciences . 2000,第4期

机译：具有平均奖励标准的向量值马尔可夫决策过程：多链案例
3. NEW CLASS OF POLICIES IN VECTOR-VALUED MARKOV DECISION PROCESSES [J] . Wakuta K. Journal of Mathematical Analysis and Applications . 1996,第2期

机译：价值向量马尔可夫决策过程中的新一类策略
4. The measure space structure of logical Markov decision processes [C] . Zhenzhen Wang, Hancheng Xing International Conference on Fuzzy Systems and Knowledge Discovery . 2013

机译：逻辑马尔可夫决策过程的度量空间结构
5. A New Solution for Markov Decision Processes and Its Aerospace Applications [D] . Bertram, Joshua. 2020

机译：马尔可夫决策过程及其航空应用的新解决方案
6. Entropic uncertainty relations for Markovian and non-Markovian processes under a structured bosonic reservoir [O] . Dong Wang, Ai-Jun Huang, Ross D. Hoehn, -1

机译：结构化储层下马尔可夫过程和非马尔可夫过程的熵不确定关系
7. A New Class of Policies in Vector-Valued Markov Decision Processes [O] . Wakuta Kazuyoshi 1996

机译：向量值马尔可夫决策过程中的一类新策略
8. Two Short Notes on Markov Processes: I. A Test for Sub-Optimal Actions in Markovian Decision Problems. II. An Intrinsically Determined Markov Chain [R] . MacQueen, J. B. 1966

机译：关于马尔可夫过程的两个简短说明：I。马尔可夫决策问题中次优最优行动的检验。 II。本质上确定的马尔可夫链

A note on the structure of value spaces in vector-valued Markov decision processes

摘要

著录项

相似文献

相关主题

期刊订阅