首页> 外文会议> >Certainty equivalence for imperfect information finite state-space stochastic games

【24h】

Certainty equivalence for imperfect information finite state-space stochastic games

机译：不完全信息有限状态空间随机博弈的确定性等价

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Stochastic games under imperfect information are typically computationally intractable even in the discrete-time/discrete-state case considered here. We consider a problem where one player has perfect information. A function of a conditional probability distribution is proposed as an information state. In the problem form here, the payoff is only a function of the terminal state of the system, and the initial information state is either linear or a sum of max-plus delta functions. When the initial information state belongs to these classes, its propagation is finite-dimensional. The state feedback value function is also finite-dimensional, and obtained via dynamic programming, but has a nonstandard form due to the necessity of an expanded state variable. Under a saddle point assumption, certainty equivalence is obtained and the proposed function is indeed an information state.

机译：即使在此处考虑的离散时间/离散状态情况下，具有不完善信息的随机博弈通常在计算上也是棘手的。我们考虑一个玩家拥有完善信息的问题。提出了条件概率分布的函数作为信息状态。在这里的问题形式中，收益仅是系统终端状态的函数，并且初始信息状态是线性的或最大加增量函数的总和。当初始信息状态属于这些类别时，其传播是有限维的。状态反馈值函数也是有限维的，并且是通过动态编程获得的，但是由于必须扩展状态变量，因此具有非标准形式。在鞍点假设下，获得了确定性等价，并且所建议的功能确实是一种信息状态。

著录项

来源
《》|2004年|p.3467-3472|共6页
会议地点
作者
McEneaney; W.M.;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类无线电电子学、电信技术;
关键词
stochastic games; discrete time systems; statistical distributions; dynamic programming; multidimensional systems; state-space methods; imperfect information finite state-space stochastic games; computational intractability; perfect information; conditional probability distribution; information state; initial information state; max-plus delta functions; finite-dimensional propagation; dynamic programming; saddle point assumption; certainty equivalence; discrete stochastic games;

机译：随机博弈;离散时间系统;统计分布;动态规划;多维系统;状态空间方法;不完善的信息;有限状态空间的随机博弈;计算难易度;完美信息;条件概率分布;信息状态;初始信息状态;最大值德尔塔函数;有限维传播;动态规划;鞍点假设;等价性;离散随机博弈;

相似文献

外文文献
中文文献
专利

1. Some classes of imperfect information finite state-space stochastic games with finite-dimensional solutions [J] . McEneaney WM Applied mathematics and optimization . 2004,第2期

机译：几类具有有限维解的不完全信息有限状态空间随机博弈
2. Certainty equivalence principle in stochastic differential games: An inverse problem approach [J] . Josa-Fombellida Ricardo, Pablo Rincon-Zapatero Juan Optimal Control Applications and Methods . 2019,第3期

机译：随机差异游戏的确定性等价原理：逆问题方法
3. LARGE POPULATION STOCHASTIC DYNAMIC GAMES: CLOSED-LOOP MCKEAN-VLASOV SYSTEMS AND THE NASH CERTAINTY EQUIVALENCE PRINCIPLE [J] . MINYI HUANG, ROLAND P. MALHAME, PETER E. CAINES Communications in Information and Systems . 2006,第3期

机译：大型随机动态游戏：闭环MCKEAN-VLASOV系统和NASH等效性原理
4. Nash Certainty Equivalence in Large Population Stochastic Dynamic Games: Connections with the Physics of Interacting Particle Systems [C] . Minyi Huang, Malhame, R.P., . 2006

机译：大种群随机动态博弈中的纳什确定性等价性：与相互作用粒子系统的物理联系
5. Essays on Stochastic Games and on Strategic Equivalence between Normal Form Games. [D] . Polydoro, Angelo. 2011

机译：关于随机游戏和规范形式游戏之间的战略对等的论文。
6. State-space reduction and equivalence class sampling for a molecular self-assembly model [O] . Daniel M. Packwood, Patrick Han, Taro Hitosugi 2016

机译：分子自组装模型的状态空间缩减和等价类采样
7. Certainty equivalence principle in stochastic differential games: An inverse problem approach [O] . Ricardo Josa-Fombellida, Juan Pablo Rincón-Zapatero 2019

机译：随机差异游戏中确定性的等价原理：逆问题方法

Certainty equivalence for imperfect information finite state-space stochastic games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅