首页> 外文期刊>Expert Systems with Application >A hybrid agent architecture integrating desire, intention and reinforcement learning
【24h】

A hybrid agent architecture integrating desire, intention and reinforcement learning

机译:混合了欲望,意图和强化学习的混合主体架构

获取原文
获取原文并翻译 | 示例

摘要

This paper presents a hybrid agent architecture that integrates the behaviours of BDI agents, specifically desire and intention, with a neural network based reinforcement learner known as Temporal Difference-Fusion Architecture for Learning and COgNition (TD-FALCON). With the explicit maintenance of goals, the agent performs reinforcement learning with the awareness of its objectives instead of relying on external reinforcement signals. More importantly, the intention module equips the hybrid architecture with deliberative planning capabilities, enabling the agent to purposefully maintain an agenda of actions to perform and reducing the need of constantly sensing the environment. Through reinforcement learning, plans can also be learned and evaluated without the rigidity of user-defined plans as used in traditional BDI systems. For intention and reinforcement learning to work cooperatively, two strategies are presented for combining the intention module and the reactive learning module for decision making in a real time environment. Our case study based on a minefield navigation domain investigates how the desire and intention modules may cooperatively enhance the capability of a pure reinforcement learner. The empirical results show that the hybrid architecture is able to learn plans efficiently and tap both intentional and reactive action execution to yield a robust performance.
机译:本文提出了一种混合式智能体体系结构,该体系结构将BDI智能体的行为(特别是欲望和意图)与基于神经网络的强化学习器集成在一起,称为学习和认知的时间差异融合体系结构(TD-FALCON)。通过明确维护目标,代理可以在意识到目标的情况下进行强化学习,而不必依赖外部强化信号。更重要的是,意图模块为混合体系结构提供了计划性的计划功能,使代理能够有目的地维护要执行的动作的议程,并减少了不断感知环境的需求。通过强化学习,还可以学习和评估计划,而无需像传统的BDI系统中那样使用用户定义的计划。为了使意图学习和强化学习能够协同工作,提出了两种策略,用于将意图模块和反应性学习模块相结合,以便在实时环境中进行决策。我们基于雷区导航域的案例研究调查了愿望和意图模块如何协作增强纯强化学习者的能力。实证结果表明,混合体系结构能够有效地学习计划,并有意和有反应地执行动作以产生强大的性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号