首页> 外文会议>International Conference Devoted to the Anniversary of Alexander Popov >CONCEPT EXTRACTION USING TEMPORAL-DIFFERENCE NETWORK EUROCON2009
【24h】

CONCEPT EXTRACTION USING TEMPORAL-DIFFERENCE NETWORK EUROCON2009

机译:使用时间差异网络欧洲欧洲欧洲差异网络提取概念提取

获取原文

摘要

In this paper, we propose a novel framework to extract temporally extended concepts in a grid world environment using a probable data structure named temporal-difference network. First a reinforcement-learning agent tries to learn its environment for the task of wall following. After that we train a newly introduced temporal-difference network (TDN) in the brain of the agent in order to gain a predictive model of the environment. At last the most promising sequences of action-observation of the given environment will be sorted out based on their probability.
机译:在本文中,我们提出了一种新颖的框架,用于使用名为时间差网络的可能数据结构在网格世界环境中提取时间扩展概念。首先,钢筋学习代理试图为墙壁的任务学习其环境。之后,我们在代理的大脑中训练新引进的时间差网络(TDN),以获得环境的预测模型。最后,将根据其概率对给定环境进行排序的最有前途的行动观察序列。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号