首页> 外文OA文献 >Protecting against evaluation overfitting in empirical reinforcement learning

【2h】

Protecting against evaluation overfitting in empirical reinforcement learning

机译：在经验强化学习中防止评估过拟合

页面导航

摘要
著录项
相似文献
相关主题

摘要

Empirical evaluations play an important role in machine learning. However, the usefulness of any evaluation depends on the empirical methodology employed. Designing good empirical methodologies is difficult in part because agents can overfit test evaluations and thereby obtain misleadingly high scores. We argue that reinforcement learning is particularly vulnerable to environment overfitting and propose as a remedy generalized methodologies, in which evaluations are based on multiple environments sampled from a distribution. In addition, we consider how to summarize performance when scores from different environments may not have commensurate values. Finally, we present proof-of-concept results demonstrating how these methodologies can validate an intuitively useful range-adaptive tile coding method.

机译：实证评估在机器学习中起着重要作用。但是，任何评估的有用性取决于所采用的经验方法。设计良好的经验方法是困难的，部分原因是代理商可能过度拟合测试评估，从而获得令人误解的高分。我们认为强化学习特别容易受到环境过度拟合的影响，并提出了一种补救性的通用方法，其中评估是基于从分布中采样的多个环境进行的。此外，当来自不同环境的分数可能没有相对应的价值时，我们考虑如何总结性能。最后，我们提出概念验证的结果，说明这些方法如何验证一种直观有用的范围自适应瓦片编码方法。

著录项

作者
Whiteson S.; Tanner B.; Taylor M.E.; Stone P.;
展开▼
作者单位

展开▼
年度 2011
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Empirically evaluating the application of reinforcement learning to the induction of effective and adaptive pedagogical strategies [J] . Min Chi, Kurt VanLehn, Diane Litinan, User modeling and user-adapted interaction . 2011,第1a2期

机译：根据经验评估强化学习在诱导有效和适应性教学策略中的应用
2. Introduction to the special issue on empirical evaluations in reinforcement learning [J] . Shimon Whiteson, Michael L. Littman Machine Learning . 2011,第1a2期

机译：强化学习中的经验评估专题介绍
3. Empirical evaluation methods for multiobjective reinforcement learning algorithms [J] . Peter Vamplew, Richard Dazeley, Adam Berry, Machine Learning . 2011,第1a2期

机译：多目标强化学习算法的经验评估方法
4. Protecting against evaluation overfitting in empirical reinforcement learning [C] . Whiteson Shimon, Tanner Brian, Taylor Matthew E., 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning . 2011

机译：在经验强化学习中防止评估过拟合
5. Improved empirical methods in reinforcement-learning evaluation [D] . Marivate, Vukosi N. 2015

机译：强化学习评估中改进的经验方法
6. An empirical overview of nonlinearity and overfitting in machine learning using COVID-19 data [O] . Yaohao Peng, Mateus Hiro Nagata -1

机译：使用COVID-19数据的机器学习中非线性和过度拟合的经验概述
7. Protecting against evaluation overfitting in empirical reinforcement learning [O] . Shimon Whiteson, Brian Tanner, Matthew E. Taylor, 2011

机译：在经验强化学习中防止评估过拟合

Protecting against evaluation overfitting in empirical reinforcement learning

摘要

著录项

相似文献

相关主题

期刊订阅