首页> 外文会议>IEEE Symposium on Computational Intelligence and Games >Temporal difference learning with interpolated table value functions
【24h】

Temporal difference learning with interpolated table value functions

机译:具有内插表值函数的时间差异学习

获取原文
获取外文期刊封面目录资料

摘要

This paper introduces a novel function approximation architecture especially well suited to temporal difference learning. The architecture is based on using sets of interpolated table look-up functions. These offer rapid and stable learning, and are efficient when the number of inputs is small. An empirical investigation is conducted to test their performance on a supervised learning task, and on themountain car problem, a standard reinforcement learning benchmark. In each case, the interpolated table functions offer competitive performance.
机译:本文介绍了一种新颖的函数近似架构,尤其适用于时间差异学习。该体系结构基于使用内插表查找功能集。这些提供快速稳定的学习,并且当输入数量小时,有效。进行了实证调查,以测试其在监督学习任务上的表现,并在其上追溯车问题,标准加强学习基准。在每种情况下,内插表功能都提供竞争性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号