Toward a classification of finite partial-monitoring games

机译：朝着有限部分监测游戏的分类

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
相似文献
相关主题

摘要

Partial-monitoring games constitute a mathematical framework for sequentialdecision making problems with imperfect feedback: The learner repeatedlychooses an action, opponent responds with an outcome, and then the learnersuffers a loss and receives a feedback signal, both of which are fixedfunctions of the action and the outcome. The goal of the learner is to minimizehis total cumulative loss. We make progress towards the classification of thesegames based on their minimax expected regret. Namely, we classify almost allgames with two outcomes and finite number of actions: We show that theirminimax expected regret is either zero, $widetilde{Theta}(sqrt{T})$,$Theta(T^{2/3})$, or $Theta(T)$ and we give a simple and efficientlycomputable classification of these four classes of games. Our hope is that theresult can serve as a stepping stone toward classifying all finitepartial-monitoring games.

机译：部分监控游戏构成了序列的数学框架，用于序列的反馈问题：学习者重复一个动作，对手用结果响应，然后学习损失并接收到反馈信号，两者都是动作的固定禁止结果。学习者的目标是最小化总累积损失。我们基于最低限度预期遗憾，我们对Spiceames分类进行了进展。即，我们分类了两个结果和有限次数的所有方法：我们展示了他们的inminimax预期遗憾是零，$ widetilde { theta}（ sqrt {t}）$，$ theta（t ^ {2 / 3}）$，或$ theta（t）$，我们为这四个课堂进行了简单而有效的追查分类。我们的希望是，审查可以作为分类所有限制监测游戏的踏脚石。

著录项

作者
András Antos; Gábor Bartók; Dávid Pál; Csaba Szepesvári;
展开▼
作者单位

展开▼
年度 2013
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Toward a classification of finite partial-monitoring games [J] . Andras Antos, Gabor Bartok, David Pal, Theoretical computer science . 2013,第Null期

机译：走向有限局部监控游戏的分类
2. Wheelchair Basketball Competition Heart Rate Profile According to Players’ Functional Classification, Tournament Level, Game Type, Game Quarter and Playing Time [J] . Jolanta Marsza?ek, Karol Gryko, Andrzej Kosmol, Frontiers in Psychology . 2019,第a期

机译：轮椅篮球比赛心率概况根据球员的功能分类，锦标赛等级，游戏类型，游戏区和演奏时间
3. Wheelchair Basketball Competition Heart Rate Profile According to Players’ Functional Classification, Tournament Level, Game Type, Game Quarter and Playing Time [J] . Marsza?ek Jolanta, Gryko Karol, Kosmol Andrzej, Frontiers in Psychology . 2019,第2期

机译：轮椅篮球比赛心率概况根据球员的功能分类，锦标赛等级，游戏类型，游戏四分之一和演奏时间
4. Toward a Classification of Finite Partial-Monitoring Games [C] . Gabor Bartok, David Pal, Csaba Szepesvari Algorithmic learning theory . 2010

机译：走向有限局部监控游戏的分类
5. Classification of EEG Signals of User States in Gaming Using Machine Learning [D] . Mallapragada, Chandana. 2018

机译：使用机器学习的游戏中用户状态的EEG信号分类
6. Wheelchair Basketball Competition Heart Rate Profile According to Players’ Functional Classification, Tournament Level, Game Type, Game Quarter and Playing Time [O] . Jolanta Marszałek, Karol Gryko, Andrzej Kosmol, 2005

机译：根据运动员的功能分类，比赛级别，比赛类型，比赛季度和比赛时间而定的轮椅篮球比赛心率曲线
7. Finite Mean Field Games: Fictitious play and convergence to a first order continuous mean field game [O] . Saeed Hadikhanloo, Francisco J. Silva 2019

机译：有限平均野外游戏：虚构的游戏和融合到一阶连续平均野外游戏

Toward a classification of finite partial-monitoring games

摘要

著录项

相似文献

相关主题

期刊订阅