Coordinated Exploration in Stochastic Common Interest Games

机译：随机共同利益游戏协调探索

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently we proposed a new exploration technique for individual reinforcement learners, which helps them to coordinate on the Pareto Optimal Nash equilibrium of a game. This technique in which agents may exclude one or more of the actions from their action space, can be seen as a discrete version of the traditional ε-greedy exploration technique. In this paper we refine this exploration technique further, with a standard technique from general search problems, i.e. random restarts. Due to this refinement, we are able to prove convergence to the Pareto Optimal Nash equilibrium in general stochastic common interest games. Moreover communication becomes unnecessary. Experiments show this technique on 2 challenging test problems and examine it's use in larger joint action spaces.

机译：最近，我们提出了一种为个人加强学习者提供了新的探索技术，这有助于他们协调游戏的帕累托最佳纳什均衡。这种技术可以从其动作空间中排除一个或多个动作的动作，可以看作是传统ε-贪婪探索技术的离散版本。在本文中，我们进一步优化了这种探索技术，具有来自一般搜索问题的标准技术，即随机重启。由于这种细化，我们能够在通用随机共同兴趣游戏中证明帕累托最佳纳什均衡的收敛性。而且沟通变得不必要。实验表明了这项技术在2个挑战性测试问题上，并检查它在更大的联合行动空间中使用。

著录项

来源
《Symposium on Adaptative Agents and Multi-Agent Systems》|2003年||共6页
会议地点
作者
Katja Verbeeck; Ann Nowe; Karl Tuyls;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词

相似文献

外文文献
中文文献
专利

1. Stochastic games for power grid coordinated defence against coordinated attacks [J] . Xiaomeng Feng, Qiuye Sun IET Cyber-Physical Systems: Theory & Applications . 2020,第3期

机译：电网随机游戏，协调防御协调攻击
2. Common Information Based Markov Perfect Equilibria for Stochastic Games With Asymmetric Information: Finite Games [J] . IEEE Transactions on Automatic Control . 2014,第3期

机译：基于公共信息的具有不对称信息的随机博弈的马尔可夫完美均衡：有限博弈
3. A reinforcement learning approach to coordinate exploration with limited communication in continuous action games [J] . Rodriguez Abdel, Vrancx Peter, Grau Ricardo, The Knowledge Engineering Review . 2016,第1期

机译：在连续动作游戏中通过有限的沟通协调探索的强化学习方法
4. Coordinated Exploration in Stochastic Common Interest Games [C] . Katja Verbeeck, Ann Nowe, Karl Tuyls Symposium on Adaptative Agents and Multi-Agent Systems . 2003

机译：随机共同利益游戏协调探索
5. The predictive validity of Coordinate Algebra common district assessments on high-stakes Coordinate Algebra end of course assessment. [D] . Ainsworth, Jessica Marie. 2016

机译：坐标代数公共区域评估对高风险坐标代数课程评估的预测有效性。
6. Laparoscopic transcystic common bile duct exploration and laparoscopic transductal common bile duct exploration in elderly patients with cholecystolithiasis combined with choledocholithiasis [O] . Yun-Feng Wang, Ai-Li Wang, Zhen Li, 2019

机译：老年胆囊结石合并胆总管结石患者的腹腔镜经囊性胆总管探查和腹腔镜经导管胆总管探查
7. Hybrid Stochastic Exploration Using Grey Wolf Optimizer and Coordinated Multi-Robot Exploration Algorithms [O] . Kamalova Albina, Suk Gyu Lee 2019

机译：灰狼优化器的混合随机探索，协调多机器人勘探算法
8. Appropriation of Common Access Natural Resources Through Exploration: A Differential Game of a Claiming Rush [R] . Mohr, E. 1985

机译：通过探索占用共同获取自然资源：一种声称匆忙的微分对策

Coordinated Exploration in Stochastic Common Interest Games

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅