首页> 外文会议>IEEE Conference on Decision and Control >Independently Randomized Symmetric Policies are Optimal for Exchangeable Stochastic Teams with Infinitely Many Decision Makers

【24h】

Independently Randomized Symmetric Policies are Optimal for Exchangeable Stochastic Teams with Infinitely Many Decision Makers

机译：独立随机的对称政策对于无限多决策者的可交换随机团队是最优的

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study stochastic team (known also as decentralized stochastic control or identical interest stochastic game) problems with large or countably infinite number of decision makers, and characterize existence and structural properties for (globally) optimal policies. We consider in particular both static and dynamic non-convex team problems where the cost function and dynamics satisfy an exchangeability condition. We first establish a de Finetti type representation theorem for exchangeable decentralized policies, that is, for the probability measures induced by admissible policies under decentralized information structures. For a general setup of stochastic team problems with N decision makers, under exchangeability of observations of decision makers and the cost function, we show that without loss of global optimality, the search for optimal policies over any convex set of probability measures on policies can be restricted to those that are N-exchangeable. Then, by extending N-exchangeable policies to infinitely exchangeable ones, establishing a convergence argument for the induced costs, and using the presented de Finetti type theorem, we establish the existence of an optimal decentralized policy for static and dynamic teams with countably infinite number of decision makers, which turns out to be symmetric (i.e., identical) and randomized. In particular, unlike prior work, convexity of the cost is not assumed.

机译：我们研究随机团队（也称为分散的随机控制或相同的兴趣随机游戏）问题，具有大型或可比无限的决策者，并表征（全球）最佳政策的存在和结构性。我们特别考虑静态和动态非凸的团队问题，其中成本函数和动态满足交换性条件。我们首先建立一个可交换的分散政策的De Finetti型表示定理，即，在分散的信息结构下受理政策引起的概率措施。对于N决策者的一般性设置，在决策者的可交换性和成本职能的可交换性下，我们展示了没有损失全球最优性的情况下，在任何凸起的政策上的任何凸起概率措施上都可以获得最佳政策限于那些是N-易换的人。然后，通过将N-易换的策略扩展到无数可交换的策略，为诱导成本建立融合参数，并使用所呈现的de Finetti型定理，我们建立了具有可选无限数量的静态和动态团队的最佳分散政策的存在决策者，结果是对称（即，相同）和随机化。特别是，与现有工作不同，没有假设成本的凸起。

著录项

来源
《IEEE Conference on Decision and Control》|2020年|5986-5991|共6页
会议地点
作者
Sina Sanjari; Naci Saldi; Serdar Yüksel;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Cost function; Games; Stochastic processes; Topology; Random variables; Q measurement; Extraterrestrial measurements;

机译：成本函数;游戏;随机过程;拓扑;随机变量;Q测量;外星测量;

相似文献

外文文献
中文文献
专利

1. Optimal Solutions to Infinite-Player Stochastic Teams and Mean-Field Teams [J] . Sanjari Sina, Yuksel Serdar IEEE Transactions on Automatic Control . 2021,第3期

机译：无限播放器随机团队和卑鄙领域的最佳解决方案
2. A Topology for Team Policies and Existence of Optimal Team Policies in Stochastic Team Theory [J] . Saldi Naci IEEE Transactions on Automatic Control . 2020,第1期

机译：随机团队理论中最优小组政策的团队政策和存在的拓扑
3. Designing evaluation studies to optimally inform policy: what factors do policy-makers in China consider when making resource allocation decisions on healthcare worker training programmes? [J] . Shishi Wu, Helena Legido-Quigley, Julia Spencer, Health Research Policy and Systems . 2018,第1期

机译：设计评价研究以最佳地通知政策：在中国政策制定者在制定关于医疗工作者培训计划的资源分配决策时考虑哪些因素？
4. Optimal Stochastic Teams with Infinitely Many Decision Makers and Mean-Field Teams [C] . Sina Sanjari, Serdar Yüksel IEEE Conference on Decision and Control . 2018

机译：具有无限多的决策者和均值团队的最优随机团队
5. Optimal control policies for stochastic networks with multiple decision makers. [D] . McInvale, Howard D. 2009

机译：具有多个决策者的随机网络的最优控制策略。
6. Designing evaluation studies to optimally inform policy: what factors do policy-makers in China consider when making resource allocation decisions on healthcare worker training programmes? [O] . Shishi Wu, Helena Legido-Quigley, Julia Spencer, 2018

机译：设计评估研究以最佳地为政策提供信息：中国的决策者在制定医护人员培训计划的资源分配决策时会考虑哪些因素？
7. Optimum configuration for distributed teams of two decision-makers [O] . 1988

机译：两个决策者的分布式团队的最佳配置

Independently Randomized Symmetric Policies are Optimal for Exchangeable Stochastic Teams with Infinitely Many Decision Makers

摘要

著录项

相似文献

相关主题

期刊订阅