A hybrid agent architecture integrating desire, intention and reinforcement learning

Ah-Hwee Tan; Yew-Soon Ong; Akejariyawong Tapanuj

首页> 外文期刊>Expert Systems with Application >A hybrid agent architecture integrating desire, intention and reinforcement learning

【24h】

A hybrid agent architecture integrating desire, intention and reinforcement learning

机译：混合了欲望，意图和强化学习的混合主体架构

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This paper presents a hybrid agent architecture that integrates the behaviours of BDI agents, specifically desire and intention, with a neural network based reinforcement learner known as Temporal Difference-Fusion Architecture for Learning and COgNition (TD-FALCON). With the explicit maintenance of goals, the agent performs reinforcement learning with the awareness of its objectives instead of relying on external reinforcement signals. More importantly, the intention module equips the hybrid architecture with deliberative planning capabilities, enabling the agent to purposefully maintain an agenda of actions to perform and reducing the need of constantly sensing the environment. Through reinforcement learning, plans can also be learned and evaluated without the rigidity of user-defined plans as used in traditional BDI systems. For intention and reinforcement learning to work cooperatively, two strategies are presented for combining the intention module and the reactive learning module for decision making in a real time environment. Our case study based on a minefield navigation domain investigates how the desire and intention modules may cooperatively enhance the capability of a pure reinforcement learner. The empirical results show that the hybrid architecture is able to learn plans efficiently and tap both intentional and reactive action execution to yield a robust performance.

机译：本文提出了一种混合式智能体体系结构，该体系结构将BDI智能体的行为（特别是欲望和意图）与基于神经网络的强化学习器集成在一起，称为学习和认知的时间差异融合体系结构（TD-FALCON）。通过明确维护目标，代理可以在意识到目标的情况下进行强化学习，而不必依赖外部强化信号。更重要的是，意图模块为混合体系结构提供了计划性的计划功能，使代理能够有目的地维护要执行的动作的议程，并减少了不断感知环境的需求。通过强化学习，还可以学习和评估计划，而无需像传统的BDI系统中那样使用用户定义的计划。为了使意图学习和强化学习能够协同工作，提出了两种策略，用于将意图模块和反应性学习模块相结合，以便在实时环境中进行决策。我们基于雷区导航域的案例研究调查了愿望和意图模块如何协作增强纯强化学习者的能力。实证结果表明，混合体系结构能够有效地学习计划，并有意和有反应地执行动作以产生强大的性能。

著录项

来源
《Expert Systems with Application》 |2011年第7期|p.8477-8487|共11页
作者
Ah-Hwee Tan; Yew-Soon Ong; Akejariyawong Tapanuj;
展开▼
作者单位

School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore;

School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore;

School of Computer Engineering, Nanyang Technological University, Nanyang Avenue, Singapore 639798, Singapore;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
bdi architecture; reinforcement learning; plan learning; self-organizing neural networks; minefield navigation;

机译：bdi体系结构;强化学习;计划学习;自组织神经网络;雷区导航;

相似文献

外文文献
中文文献
专利

1. A self-organizing neural architecture integrating desire, intention and reinforcement learning [J] . Ah-Hwee Tan, Yu-Hong Feng, Yew-Soon Ong Neurocomputing . 2010,第7a9期

机译：一个自组织的神经体系结构，融合了愿望，意图和强化学习
2. Leveraging the Beliefs-Desires-Intentions Agent Architecture [J] . Arnaldo Perez Castano MSDN Magazine . 2019,第1期

机译：利用信念，愿望，意图代理架构
3. A Belief–Desire–Intention Multi-agent Architecture for Efficient Power Plant Disturbance Analysis [J] . Jonas Pesente, Gustavo Herbig, Miguel Moreto, Journal of control, automation and electrical systems . 2018,第3期

机译：一种用于高效电厂扰动分析的信念 - 欲望 - 意图多智能经纪架构
4. Integration of Immune Features into a Belief-Desire-Intention Model for Multi-agent Control of Public Transportation Systems [C] . Salima Mnif, Saber Darmoul, Sabeur Elkosantini, International conference on hybrid artificial intelligent systems . 2017

机译：将免疫功能集成到信念-愿望-意图模型中，以实现公共交通系统的多主体控制
5. Use of Reinforcement Learning (RL) for plan generation in Belief-Desire-Intention (BDI) agent systems [D] . Feliu, Jose L. 2013

机译：使用增强学习（RL）在信念-愿望-意图（BDI）代理系统中生成计划
6. Connecting Social Psychology and Deep Reinforcement Learning: A Probabilistic Predictor on the Intention to Do Home-Based Physical Activity After Message Exposure [O] . Patrizia Catellani, Valentina Carfora, Marco Piastra 2021

机译：连接社会心理学和深度加固学习：概率预测因素在消息曝光后做基于家庭的身体活动的意图
7. Beliefs, Obligations, Intentions and Desires as Components in an Agent Architecture [O] . Broersen, J.M., Dastani, M.M., van der Torre, L. 2005

机译：作为代理体系结构中的组件的信念，义务，意图和愿望
8. Developing Concept Learning Capabilities in the COGNET/IGEN Integrative architecture and Associated Agent-based Modeling and Behavioral Representation (AMBR) Air Traffic Control [R] . Zachary, W. , Ryder, J. , Stokes, J. , 2004

机译：在COGNET / IGEN综合架构和基于关联代理的建模和行为表示（amBR）空中交通管制中开发概念学习能力

A hybrid agent architecture integrating desire, intention and reinforcement learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅