Feature Selection for Value Function Approximation.

机译：值函数逼近的特征选择。

获取原文

获取原文并翻译 | 示例

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The field of reinforcement learning concerns the question of automated action selection given past experiences. As an agent moves through the state space, it must recognize which state choices are best in terms of allowing it to reach its goal. This is quantified with value functions, which evaluate a state and return the sum of rewards the agent can expect to receive from that state. Given a good value function, the agent can choose the actions which maximize this sum of rewards. Value functions are often chosen from a linear space defined by a set of features; this method offers a concise structure, low computational effort, and resistance to overfitting. However, because the number of features is small, this method depends heavily on these few features being expressive and useful, making the selection of these features a core problem. This document discusses this selection.;Aside from a review of the field, contributions include a new understanding of the role approximate models play in value function approximation, leading to new methods for analyzing feature sets in an intuitive way, both using the linear and the related kernelized approximation architectures. Additionally, we present a new method for automatically choosing features during value function approximation which has a bounded approximation error and produces superior policies, even in extremely noisy domains.

机译：强化学习领域涉及到根据过去的经验进行自动动作选择的问题。当一个代理在状态空间中移动时，它必须认识到哪个状态选择在使其达到目标方面是最好的。这可以通过价值函数来量化，价值函数可以评估状态并返回代理商可以期望从该状态获得的报酬总额。给定良好的价值功能，代理可以选择使报酬总额最大化的行动。值函数通常是从一组特征定义的线性空间中选择的；该方法提供了简洁的结构，较低的计算工作量以及抗过度拟合的能力。但是，由于特征的数量很少，因此该方法在很大程度上取决于这几个具有表达性和实用性的特征，这使得选择这些特征成为一个核心问题。本文档讨论了这种选择。除了对本领域的回顾外，贡献还包括对近似模型在价值函数逼近中的作用的新理解，从而提出了一种使用线性和线性方法直观地分析特征集的新方法。相关的核化近似架构。此外，我们提出了一种在值函数逼近过程中自动选择特征的新方法，该方法具有有限的逼近误差，即使在嘈杂的域中也能产生出众的策略。

著录项

作者
Taylor, Gavin.;
展开▼
作者单位

Duke University.;

展开▼
授予单位 Duke University.;
学科 Artificial Intelligence.;Computer Science.
学位 Ph.D.
年度 2011
页码 113 p.
总页数 113
原文格式 PDF
正文语种 eng
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Feature selection via binary simultaneous perturbation stochastic approximation. [J] . Aksakalli Vural, Malekipirbazari Milad Pattern recognition letters . 2016,第maya1期

机译：通过二元同时扰动随机逼近进行特征选择。
2. Analytical integral equation theory for a restricted primitive model of polyelectrolytes and counterions within the mean spherical approximation. II.Radial distribution functions [J] . N.von Solms, Y.C.Chiew The Journal of Chemical Physics . 2003,第9期

机译：平均球面近似范围内的聚电解质和抗衡离子的受限原始模型的解析积分方程理论。二，径向分布函数
3. Density Functional Theory without the Born-Oppenheimer Approximation. II. Green Function Techniques [J] . Y. Shigeta, H. Nagao, K. Nishikawa, International Journal of Quantum Chemistry . 1999,第4a5期

机译：没有Born-Oppenheimer近似的密度泛函理论。二。绿色功能技术
4. Modeling fiber-like conductivity structures via the boundary element method using thin-wire approximation. I construction of basis functions [C] . Annual International Conference of the IEEE Engineering in Medicine and Biology Society . 2016

机译：通过使用细线近似的边界元方法对纤维状电导率结构进行建模。一，基础功能的建构
5. Unsupervised data mining methods for functional data analysis and feature selection. [D] . Rattakorn, Panaya. 2009

机译：用于功能数据分析和特征选择的无监督数据挖掘方法。
6. The blessing of Dimensionality: Feature Selection outperforms functional connectivity-based feature transformation to classify ADHD subjects from EEG patterns of phase synchronisation [O] . Ernesto Pereda, Miguel García-Torres, Belén Melián-Batista, 2012

机译：维度的祝福：特征选择优于基于功能连通性的特征转换，可从相位同步的EEG模式对ADHD主题进行分类
7. Towards relativistic quantum chemistry. On the AB INITO calculation of relativistic electron wave functions for molecules in the hartree-fock-dirac approximation. [O] . Aerts Patrick Johan Coenraad 1986

机译：走向相对论量子化学。关于hartree-fock-dirac近似中分子的相对论电子波函数的AB INITIO计算。

Feature Selection for Value Function Approximation.

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅