Lenient Learning in Independent-Learner Stochastic Cooperative Games

Ermo Wei; Sean Luke

首页> 外文期刊>Journal of machine learning research >Lenient Learning in Independent-Learner Stochastic Cooperative Games

【24h】

Lenient Learning in Independent-Learner Stochastic Cooperative Games

机译：独立学习者随机合作游戏中的宽松学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We introduce the Lenient Multiagent Reinforcement Learning2 (LMRL2) algorithm for independent-learner stochasticcooperative games. LMRL2 is designed to overcome a pathologycalled relative overgeneralization, and to do so whilestill performing well in games with stochastic transitions,stochastic rewards, and miscoordination. We discuss the existingliterature, then compare LMRL2 against other algorithms drawnfrom the literature which can be used for games of this kind:traditional (a€?Distributeda€?) Q-learning, Hysteretic Q-learning,WoLF-PHC, SOoN, and (for repeated games only) FMQ. The resultsshow that LMRL2 is very effective in both of our measures(complete and correct policies), and is found in the top rankmore often than any other technique. LMRL2 is also easy to tune:though it has many available parameters, almost all of them stayat default settings. Generally the algorithm is optimally tunedwith a single parameter, if any. We then examine and discuss anumber of side-issues and options for LMRL2. color="gray">

机译：我们针对独立学习者随机合作博弈引入了 Lenient Multiagent Reinforcement Learning2 （LMRL2）算法。 LMRL2旨在克服称为“相对过度概括”的病理，并在具有随机过渡，随机奖励和配位不当的游戏中仍然表现良好。我们讨论了现有的文献，然后将LMRL2与从文献中得出的可用于此类游戏的其他算法进行比较：传统（a）？分布式（Q）学习，滞后Q学习，WoLF-PHC，SOoN和（仅适用于重复游戏）FMQ。结果表明，LMRL2在我们的两种措施（完整和正确的策略）中都非常有效，并且在排名最高的位置上比其他任何一种技术都更频繁。 LMRL2也很容易调整：尽管它具有许多可用参数，但几乎所有参数都保持默认设置。通常，该算法可以通过单个参数（如果有）进行优化。然后，我们检查并讨论LMRL2的许多附带问题和选项。 color =“ gray”>

著录项

来源
《Journal of machine learning research》 |2016年第84期|共42页
作者
Ermo Wei; Sean Luke;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类计算技术、计算机技术;
关键词

相似文献

外文文献
中文文献
专利

1. Using Game-Based Cooperative Learning to Improve Learning Motivation: A Study of Online Game Use in an Operating Systems Course [J] . Jong B.-S., Lai C.-H., Hsia Y.-T., Education, IEEE Transactions on . 2013,第2期

机译：使用基于游戏的合作学习来提高学习动机：在操作系统课程中使用在线游戏的研究
2. Sequential stochastic core of a cooperative stochastic programming game [J] . Xu N., Veinott Jr. A.F. Operations Research Letters: A Journal of the Operations Research Society of America . 2013,第5期

机译：合作随机规划游戏的顺序随机核心
3. DIFFERENCES OF EFFECTIVENESS OF COOPERATIVE LEARNING LEARNING MODEL TYPE TEAMS GAMES TOURNAMENT (TGT) AND GROUP WORKING ON LEARNING RESULT AT ELEMNTARY SCHOOL [J] . Heru Mudiyanto PrimaryEdu: Journal of Primary Education . 2017,第1期

机译：合作学习模式团队游戏游戏（TGT）的效能差异与小组学习对电大学习结果的影响。
4. A Convergent Multiagent Reinforcement Learning Approach for a Subclass of Cooperative Stochastic Games [C] . Thomas Kemmerich, Hans Kleine Buening Adaptive and learning agents. . 2011

机译：协作随机博弈子类的融合多主体强化学习方法
5. A Hybrid Agent Architecture for Learning Good Cooperative Behaviours for Game Characters [D] . Paskaradevan, Sanjeev 2012

机译：用于学习游戏角色良好合作行为的混合代理架构
6. The Owen Value of Stochastic Cooperative Game [O] . Cheng-Guo E, Quan-Lin Li, Shi-Yong Li -1

机译：随机合作博弈的欧文价值
7. PERBANDINGAN PENERAPAN COOPERATIVE LEARNING TIPE TEAM GAMES TOURNAMENT DENGAN COOPERATIVE LEARNING TIPE SNOWBALL THROWING TERHADAP HASIL BELAJAR SISWA PADA MATA PELAJARAN TEKNOLOGI INFORMASI DAN KOMUNIKASI. [O] . Ilhami Widia 2013

机译：在信息和通信技术眼中，向学生学习成果投掷合作学习型团队游戏锦标赛与合作学习型无障碍比赛的比较。
8. Non-Cooperative Stochastic Dominance Games. [R] . Fishburn, P. C. 1977

机译：非合作随机优势博弈。

Lenient Learning in Independent-Learner Stochastic Cooperative Games

摘要

著录项

相似文献

相关主题

期刊订阅