Toward Collaborative Reinforcement Learning Agents that Communicate Through Text-Based Natural Language

机译：朝着通过基于文本的自然语言进行沟通的协同强化学习代理

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Communication between agents in collaborative multi-agent settings is in general implicit or a direct data stream. This paper considers text-based natural language as a novel form of communication between multiple agents trained with reinforcement learning. This could be considered first steps toward a truly autonomous communication without the need to define a limited set of instructions, and natural collaboration between humans and robots. Inspired by the game of Blind Leads, we propose an environment where one agent uses natural language instructions to guide another through a maze. We test the ability of reinforcement learning agents to effectively communicate through discrete word-level symbols and show that the agents are able to sufficiently communicate through natural language with a limited vocabulary. Although the communication is not always perfect English, the agents are still able to navigate the maze. We achieve a BLEU score of 0.85, which is an improvement of 0.61 over randomly generated sequences while maintaining a 100% maze completion rate. This is a 3.5 times the performance of the random baseline using our reference set.

机译：协作多代理设置的代理之间的通信通常是隐式的或直接数据流。本文认为基于文本的自然语言作为具有加强学习培训的多个代理商之间的新型沟通形式。这可以被认为是一个真正自主沟通的首先步骤，而无需定义一组有限的指令，以及人类和机器人之间的自然协作。受到盲目领导的游戏的启发，我们提出了一种环境，其中一个代理使用自然语言指示通过迷宫引导另一个。我们测试强化学习代理能力通过离散字级符号有效地通信，并表明代理能够通过具有有限词汇的自然语言充分地通信。虽然沟通并不总是完美的英语，但代理商仍然能够导航迷宫。我们达到了0.85的BLEU得分，这是随机生成的序列超过0.61的改善，同时保持100％迷宫完成率。这是随机基线使用我们的参考集的性能的3.5倍。

著录项

来源
《Southern African Universities Power Engineering Conference;Conference on Robotics and Mechatronics;Conference on Pattern Recognition Association of South Africa》|2021年|1-6|共6页
会议地点
作者
Kevin M. Eloff; Herman A. Engelbrecht;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Vocabulary; Navigation; Natural languages; Collaboration; Reinforcement learning; Receivers; Robots;

机译：词汇;导航;自然语言;合作;加强学习;接收器;机器人;

相似文献

外文文献
中文文献
专利

1. 运用自然语言处理对证素辨证学进行文本挖掘研究 [J] . 邓文祥, 朱建平, 李静, 数字中医药(英文) . 2019,第002期
2. 运用自然语言处理对证素辨证学进行文本挖掘研究 [J] . 邓文祥, 朱建平, 李静, 数字中医药（英文） . 2019,第002期
3. Learning by Communicating in Natural Language With Conversational Agents [J] . Graesser Arthur C., Li Haiying, Forsyth Carol Current directions in psychological science: a journal of the American Psychological Society . 2014,第5期

机译：通过与会话代理进行自然语言交流来学习
4. Learning how, what, and whether to communicate: emergence of protocommunication in reinforcement learning agents [J] . Takashi Sato, Eiji Uchibe, Kenji Doya Artificial life and robotics . 2008,第1a2期

机译：学习如何，什么以及是否进行交流：强化学习主体中原始交流的出现
5. A unified framework for reinforcement learning, co-learning and meta-learning how to coordinate in collaborative multi-agent systems [J] . Predrag T. To?i?, Ricardo Vilalta Procedia Computer Science . 2010,第1期

机译：强化学习，共同学习和元学习的统一框架，如何在协作式多智能体系统中进行协调
6. Language Understanding for Text-based Games using Deep Reinforcement Learning [C] . Karthik Narasimhan, Tejas D Kulkarni, Regina Barzilay Conference on empirical methods in natural language processing . 2015

机译：使用深度强化学习的基于文本的游戏的语言理解
7. A study of collaborative distributed intelligent multi-agent reinforcement learning via multi goals for dynamic agent shortest path-planning [D] . Kim, Minsuk. 2016

机译：通过多目标进行动态代理最短路径规划的协同分布式智能多功能智能多功能多智能智能多功能
8. Novel Use of Natural Language Processing (NLP) to Predict Suicidal Ideation and Psychiatric Symptoms in a Text-Based Mental Health Intervention in Madrid [O] . Benjamin L. Cook, Ana M. Progovac, Pei Chen, 2016

机译：在马德里基于文本的心理健康干预中使用自然语言处理（NLP）预测自杀意念和精神症状的新颖方法
9. Language Understanding for Text-based Games using Deep Reinforcement Learning [O] . Narasimhan Karthik Rajagopal, Kulkarni Tejas Dattatraya, Barzilay Regina 2015

机译：使用深度强化学习对基于文本的游戏进行语言理解

Toward Collaborative Reinforcement Learning Agents that Communicate Through Text-Based Natural Language

摘要

著录项

相似文献

相关主题

期刊订阅