Efficient Convention Emergence through Decoupled Reinforcement Social Learning with Teacher-Student Mechanism

机译：通过与教师 - 学生机制脱钩加强社会学习的有效公约出现

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we design reinforcement learning based (RL-based) strategies to promote convention emergence in multiagent systems (MASs) with large convention space. We apply our approaches to a language coordination problem in which agents need to coordinate on a dominant lexicon for efficient communication. By modeling each lexicon which maps each concept to a single word as a Markov strategy representation, the original single-state convention learning problem can be transformed into a multi-state multiagent coordination problem. The dynamics of lexicon evolutions during an interaction episode can be modeled as a Markov game, which allows agents to improve the action values of each concept separately and incrementally. Specifically we propose two learning strategies, multiple-Q and multiple-R, and also propose incorporating teacher-student mechanism on top of the learning strategies to accelerate lexicon convergence speed. Extensive experiments verify that our approaches outperform the state-of-the-art approaches in terms of convergence efficiency, convention quality and scalability.

机译：在本文中，我们设计了基于加强学习的（基于RL的）策略，促进了大型会议空间的多元系统（质量）中的公约出现。我们将我们的方法应用于语言协调问题，其中代理商需要协调主导lexicon以进行高效沟通。通过将每个概念映射到单个单词作为Markov策略表示的每个概念的建模，原始单态会议学习问题可以转换为多状态的多态协调问题。交互剧集期间词典演进的动态可以被建模为马尔可夫游戏，这允许代理分别和逐步提高每个概念的动作值。具体而言，我们提出了两个学习策略，多Q和多重r，并提出将师生机制纳入学习策略，以加速词汇融合速度。广泛的实验验证了我们的方法在收敛效率，公约质量和可扩展性方面优于最先进的方法。

著录项

来源
《International Conference on Autonomous Agents and Multiagent Systems》|2018年|766-1530p|共9页
会议地点
作者
Yixi Wang; Wenhuan Lu; Jianye Hao; Jianguo Wei; Ho-Fung Leung;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
Multiagent social learning; Convention emergence;

机译：多元社会学习;会议出现;

相似文献

外文文献
中文文献
专利

1. Emergence of conventions through social learning Heterogeneous learners in complex networks [J] . Stephane Airiau, Sandip Sen, Daniel Villatoro Autonomous agents and multi-agent systems . 2014,第5期

机译：通过社会学习形成惯例复杂网络中的异构学习者
2. Emergence and differentiation model of individuality and sociality by reinforcement learning [J] . Katsunari Shibata, Masahide Ueda, Koji Ito 計測自動制御学会論文集 . 2003,第5期

机译：通过强化学习形成个性与社会的出现与分化模型
3. Emergence and differentiation model of individuality and sociality by reinforcement learning [J] . Katsunari Shibata, Masahide Ueda, Koji Ito 計測自動制御学会論文集 . 2003,第5期

机译：强化学习的个性与社会性的出现与差异化模型
4. Efficient Convention Emergence through Decoupled Reinforcement Social Learning with Teacher-Student Mechanism [C] . Yixi Wang, Wenhuan Lu, Jianye Hao, International Conference on Autonomous Agents and Multiagent Systems . 2018

机译：通过与教师 - 学生机制的解耦加强社会学习有效的公约出现
5. The Effects of Social Listener Reinforcement and Video Modeling on the Emergence of Social Verbal Operants in Preschoolers Diagnosed with Autism and Language Delays. [D] . Baker, Katherine Anne. 2014

机译：社交听众的强化和视频建模对被诊断为自闭症和语言延迟的学龄前儿童社交言语操作员的出现的影响。
6. Emergence of linguistic conventions in multi-agent reinforcement learning [O] . Dorota Lipowska, Adam Lipowski -1

机译：语言惯例在多主体强化学习中的出现
7. Emergence of conventions through social learning: Heterogeneous learners in complex networks [O] . Airiau Stéphane, Sen Sandip, Villatoro Daniel 2016

机译：通过社会学习形成惯例：复杂网络中的异构学习者

Efficient Convention Emergence through Decoupled Reinforcement Social Learning with Teacher-Student Mechanism

摘要

著录项

相似文献

相关主题

期刊订阅