首页> 外文学位 >Feature Selection for Value Function Approximation.
【24h】

Feature Selection for Value Function Approximation.

机译:值函数逼近的特征选择。

获取原文
获取原文并翻译 | 示例

摘要

The field of reinforcement learning concerns the question of automated action selection given past experiences. As an agent moves through the state space, it must recognize which state choices are best in terms of allowing it to reach its goal. This is quantified with value functions, which evaluate a state and return the sum of rewards the agent can expect to receive from that state. Given a good value function, the agent can choose the actions which maximize this sum of rewards. Value functions are often chosen from a linear space defined by a set of features; this method offers a concise structure, low computational effort, and resistance to overfitting. However, because the number of features is small, this method depends heavily on these few features being expressive and useful, making the selection of these features a core problem. This document discusses this selection.;Aside from a review of the field, contributions include a new understanding of the role approximate models play in value function approximation, leading to new methods for analyzing feature sets in an intuitive way, both using the linear and the related kernelized approximation architectures. Additionally, we present a new method for automatically choosing features during value function approximation which has a bounded approximation error and produces superior policies, even in extremely noisy domains.
机译:强化学习领域涉及到根据过去的经验进行自动动作选择的问题。当一个代理在状态空间中移动时,它必须认识到哪个状态选择在使其达到目标方面是最好的。这可以通过价值函数来量化,价值函数可以评估状态并返回代理商可以期望从该状态获得的报酬总额。给定良好的价值功能,代理可以选择使报酬总额最大化的行动。值函数通常是从一组特征定义的线性空间中选择的;该方法提供了简洁的结构,较低的计算工作量以及抗过度拟合的能力。但是,由于特征的数量很少,因此该方法在很大程度上取决于这几个具有表达性和实用性的特征,这使得选择这些特征成为一个核心问题。本文档讨论了这种选择。除了对本领域的回顾外,贡献还包括对近似模型在价值函数逼近中的作用的新理解,从而提出了一种使用线性和线性方法直观地分析特征集的新方法。相关的核化近似架构。此外,我们提出了一种在值函数逼近过程中自动选择特征的新方法,该方法具有有限的逼近误差,即使在嘈杂的域中也能产生出众的策略。

著录项

  • 作者

    Taylor, Gavin.;

  • 作者单位

    Duke University.;

  • 授予单位 Duke University.;
  • 学科 Artificial Intelligence.;Computer Science.
  • 学位 Ph.D.
  • 年度 2011
  • 页码 113 p.
  • 总页数 113
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号