首页> 外文OA文献 >Learning epistemic actions in model-free memory-free reinforcement learning: experiments with a neuro-robotic model
【2h】

Learning epistemic actions in model-free memory-free reinforcement learning: experiments with a neuro-robotic model

机译:在无模型的无记忆强化学习中学习认知行为:神经机器人模型的实验

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Passive sensory processing is often insufficient to guide biological organisms in complex environments. Rather, behaviourally relevant information can be accessed by performing so-called epistemic actions that explicitly aim at unveiling hidden information. However, it is still unclear how an autonomous agent can learn epistemic actions and how it can use them adaptively. In this work, we propose a definition of epistemic actions for POMDPs that derive from their characterizations in cognitive science and classical planning literature. We give theoretical insights about how partial observability and epistemic actions can affct the learning process and performance in the extreme conditions of model-free and memory-free reinforcement learning where hidden information cannot be represented. We finally investigate these concepts using an integrated eye-arm neural architecture for robot control, which can use its effctors to execute epistemic actions and can exploit the actively gathered information to effiently accomplish a seek-and-reach task.
机译:被动感觉处理通常不足以指导复杂环境中的生物。相反,可以通过执行明确旨在揭示隐藏信息的所谓认知行为来访问与行为相关的信息。但是,仍然不清楚一个自主主体如何学习认知行为以及如何适应性地使用它们。在这项工作中,我们提出了对POMDP的认知行为的定义,该定义源于它们在认知科学和古典计划文学中的表征。我们提供了关于部分可观察性和认知行为如何在无法表示隐藏信息的无模型和无记忆强化学习的极端条件下如何影响学习过程和性能的理论见解。最后,我们使用用于机器人控制的集成眼臂神经体系结构来研究这些概念,该体系结构可以使用其执行器执行认知行为,并可以利用主动收集的信息有效地完成“寻找并到达”任务。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号