【24h】

Rule value reinforcement learning for cognitive agents

机译:认知主体的规则价值强化学习

获取原文

摘要

RVRL (Rule Value Reinforcement Learning) is a new algorithm which extends an existing learning framework that models the environment of a situated agent using a probabilistic rule representation. The algorithm attaches values to learned rules by adapting reinforcement learning. Structure captured by the rules is used to form a policy. The resulting rule values represent the utility of taking an action if the rule's conditions are present in the agent's current percept. Advantages of the new framework are demonstrated, through examples in a predator-prey environment.
机译:RVRL(规则价值强化学习)是一种新算法,它扩展了现有的学习框架,该框架使用概率规则表示来对坐席环境进行建模。该算法通过适应强化学习,将值附加到学习的规则上。规则捕获的结构用于形成策略。如果代理的当前感知中存在规则的条件,则所得规则值表示采取操作的效用。通过在“捕食者-被捕食者”环境中的示例展示了新框架的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号