Cooperation and learning in a multiagent mobile-robot system.

机译：多代理移动机器人系统中的合作和学习。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
相似文献
相关主题

摘要

Designing intelligent agents that represent mobile robots in a complex environment is challenging, especially if the agents are to cooperate in a multiagent context and learn from experience. Depending on the specifications of the robots, the application requirements, and the type of environment, the design procedure will be different from one application to another.; No matter what the application is, the most important step in designing an intelligent agent is identifying its architecture, which shows the capabilities of the agent and reflects the nature of the environment.; Once the architecture of the agent is designed, modules and subsystems of the agent can be implemented according to the specifications.; In this thesis, the main application is robotic soccer in the category of small robots with an omni-scientific knowledge of the environment and without physical inter-robot communication. To make this application multiagent with all the necessities such as autonomy, concurrency and ability to communicate, an agent-oriented simulator was created to control small robots.; The agent architecture is designed to have two reactive and coordination layers to accommodate the rapid changes in the environment along with coordination with other agents. The reactive layer is equipped with a repository of behaviours that can make a direct connection from perception to action for a fast reaction to new situations. The coordination layer coordinates the local behaviours to choose the best behaviour according to the current state of environment and also coordinates the local decision of the agent with other agents to achieve cooperation and avoid conflicts.; Since, the agent is totally controlled by individual behaviours, its performance entirely depends on the behaviour design. In this thesis, a methodology is proposed and tested to facilitate the behaviour design procedure based on human experience.; Learning from experience, or in other words improving the quality of actions incrementally, is addressed by learning how to select behaviours in different situations. Although the individual behaviours are not adaptable, the arbitration among them is adaptable. The environment provides reward and/or punishment and the agent learns to choose those behaviours that bring reward, and avoid those that lead to punishment. Different reinforcement learning techniques are examined for this purpose.; As a necessary part of reinforcement learning and to deal with the continuous state variables in this application, a function approximation technique called Adaptive Fuzzy (AF) technique is introduced and compared with some other existing techniques.; All the agents with the hybrid architecture, equipped by properly designed behaviours, having adaptable arbitration unit, and able to communicate with each other were put together to play a full game. Simulations showed a cooperative social behaviour and proved that the agents were capable of learning from experience.

机译：设计代表复杂环境中的移动机器人的智能代理具有挑战性，特别是如果代理要在多代理环境中进行协作并从经验中学习的话。根据机器人的规格，应用程序要求和环境类型的不同，设计过程因一个应用程序而异。无论应用程序是什么，设计智能代理程序的最重要步骤是确定其体系结构，该体系结构可显示代理程序的功能并反映环境的性质。一旦设计了代理的体系结构，就可以根据规范实现代理的模块和子系统。在本文中，主要的应用是小型足球机器人，它具有对环境的全面科学的知识，并且不需要物理上的机器人间通信。为了使该应用程序具有自治性，并发性和通信能力等所有必需品，创建了一个面向代理的模拟器来控制小型机器人。代理架构被设计为具有两个反应层和协调层，以适应环境的快速变化以及与其他代理的协调。反应层配备了行为库，可以从感知到行动直接建立联系，以快速响应新情况。协调层协调当地的行为，以根据当前的环境状况选择最佳的行为，并协调代理与其他代理的本地决策，以实现合作，避免冲突。由于代理完全由个人行为控制，因此其性能完全取决于行为设计。本文提出了一种基于人类经验的行为测试方法，并对其进行了测试。通过学习如何选择不同情况下的行为，可以解决从经验中学习或换句话说逐步提高行为质量的问题。尽管各个行为是不可适应的，但它们之间的仲裁是可适应的。环境提供奖励和/或惩罚，而代理人学会选择那些带来奖励的行为，并避免那些导致惩罚的行为。为此目的，研究了各种强化学习技术。在本应用中，作为强化学习的必要部分并处理连续状态变量，引入了一种称为自适应模糊（AF）技术的函数逼近技术，并将其与其他一些现有技术进行了比较。所有具有混合体系结构，具有适当设计的行为，具有可调整的仲裁单元并能够相互通信的代理都被放在一起玩一个完整的游戏。模拟显示了一种合作的社会行为，并证明了代理能够从经验中学习。

著录项

作者
Tehrani, Ali M.;
展开▼
作者单位

University of Waterloo (Canada).;

展开▼
授予单位 University of Waterloo (Canada).;
学科 Engineering Electronics and Electrical.; Engineering Mechanical.; Engineering System Science.
学位 Ph.D.
年度 2004
页码 141 p.
总页数 141
原文格式 PDF
正文语种 eng
中图分类无线电电子学、电信技术;机械、仪表工业;系统科学;
关键词

相似文献

外文文献
中文文献
专利

1. A New Distributed Reinforcement Learning Approach for Multiagent Cooperation Using Team-mate Modeling and Joint Action Generalization [J] . Wiem Zemzem, Ines Hosni Advances in Science, Technology and Engineering Systems . 2020,第2期

机译：使用团队 - 伴侣建模和联合行动泛化进行多读合作的新分布式强化学习方法
2. Multiagent cooperation for solving global optimization problems: an extendible framework with example cooperation strategies [J] . Fatma Basak Aydemir, Akin Guenay, Figen OEztoprak, Journal of Global Optimization . 2013,第2期

机译：解决全局优化问题的多主体合作：带有示例合作策略的可扩展框架
3. Multiagent Reinforcement Social Learning toward Coordination in Cooperative Multiagent Systems [J] . JIANYE HAO, HO-FUNG LEUNG, ZHONG MING ACM transactions on autonomous and adaptive systems . 2015,第4期

机译：协作式多智能体系统中的多智能体增强社会学习以促进协调
4. Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma: Fast cooperation through evolved payoffs [C] . Vassiliades Vassilis, Christodoulou Chris The 2010 International Joint Conference on Neural Networks . 2010

机译：迭代囚徒困境中的多主体强化学习：通过不断发展的收益快速合作
5. Learning ontologies in a multiagent system. [D] . Williams, Andrew Brent. 1999

机译：在多主体系统中学习本体。
6. Multiagent cooperation and competition with deep reinforcement learning [O] . Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, -1

机译：多主体合作与竞争与深度强化学习
7. Multiagent cooperation and competition with deep reinforcement learning. [O] . Ardi Tampuu, Tambet Matiisen, Dorian Kodelja, 2017

机译：多智能体与强化学习的合作与竞争。

Cooperation and learning in a multiagent mobile-robot system.

摘要

著录项

相似文献

相关主题

期刊订阅