首页> 外文期刊>Neurocomputing >A self-organizing developmental cognitive architecture with interactive reinforcement learning
【24h】

A self-organizing developmental cognitive architecture with interactive reinforcement learning

机译:具有互动强化学习的自组织发展认知架构

获取原文
获取原文并翻译 | 示例
           

摘要

Developmental cognitive systems can endow robots with the abilities to incrementally learn knowledge and autonomously adapt to complex environments. Conventional cognitive methods often acquire knowledge through passive perception, such as observing and listening. However, this learning way may generate incorrect representations inevitably and cannot correct them online without any feedback. To tackle this problem, we propose a biologically-inspired hierarchical cognitive system called Self-Organizing Developmental Cognitive Architecture with Interactive Reinforcement Learning (SODCA-IRL). The architecture introduces interactive reinforcement learning into hierarchical self-organizing incremental neural networks to simultaneously learn object concepts and fine-tune the learned knowledge by interacting with humans. In order to realize the integration, we equip individual neural networks with a memory model, which is designed as an exponential function controlled by two forgetting factors to simulate the consolidation and forgetting processes of humans. Besides, an interactive reinforcement strategy is designed to provide appropriate rewards and execute mistake correction. The feedback acts on the forgetting factors to reinforce or weaken the memory of neurons. Therefore, correct knowledge is preserved while incorrect representations are forgotten. Experimental results show that the proposed method can make effective use of the feedback from humans to improve the learning effectiveness significantly and reduce the model redundancy. (c) 2019 Elsevier B.V. All rights reserved.
机译:发展性认知系统可以赋予机器人逐步学习知识并自动适应复杂环境的能力。常规的认知方法通常通过被动的感知(例如观察和听)来获取知识。然而,这种学习方式可能不可避免地产生不正确的表示,并且在没有任何反馈的情况下无法在线纠正它们。为了解决这个问题,我们提出了一种具有生物学启发的分层认知系统,称为“具有交互式强化学习的自组织发展认知架构”(SODCA-IRL)。该体系结构将交互式强化学习引入到分层自组织的增量神经网络中,以同时学习对象概念并通过与人类交互来微调所学知识。为了实现整合,我们为单个神经网络配备了一个记忆模型,该记忆模型被设计为由两个遗忘因素控制的指数函数,以模拟人类的巩固和遗忘过程。此外,还设计了一种交互式强化策略,以提供适当的奖励并执行错误纠正。反馈作用于遗忘因素,以增强或减弱神经元的记忆。因此,保留了正确的知识,而忘记了错误的表示。实验结果表明,该方法可以有效利用人类的反馈,显着提高学习效果,减少模型冗余。 (c)2019 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号