Empirical evaluation methods for multiobjective reinforcement learning algorithms

Peter Vamplew; Richard Dazeley; Adam Berry; Rustam Issabekov; Evan Dekker

首页> 外文期刊>Machine Learning >Empirical evaluation methods for multiobjective reinforcement learning algorithms

【24h】

Empirical evaluation methods for multiobjective reinforcement learning algorithms

机译：多目标强化学习算法的经验评估方法

获取原文

获取原文并翻译 | 示例

获取外文期刊封面封底 >>

开具论文收录证明 >>

文献代查 >>

团队文献服务 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

While a number of algorithms for multiobjective reinforcement learning have been proposed, and a small number of applications developed, there has been very little rigorous empirical evaluation of the performance and limitations of these algorithms. This paper proposes standard methods for such empirical evaluation, to act as a foundation for future comparative studies. Two classes of multiobjective reinforcement learning algorithms are identified, and appropriate evaluation metrics and methodologies are proposed for each class. A suite of benchmark problems with known Pareto fronts is described, and future extensions and implementations of this benchmark suite are discussed. The utility of the proposed evaluation methods are demonstrated via an empirical comparison of two example learning algorithms.

机译：虽然已经提出了许多用于多目标强化学习的算法，并且开发了少量应用程序，但对这些算法的性能和局限性进行的严格经验评估很少。本文提出了这种经验评估的标准方法，以作为未来比较研究的基础。确定了两类多目标强化学习算法，并为每类提出了适当的评估指标和方法。描述了具有已知Pareto前沿的一组基准测试问题，并讨论了该基准测试套件的未来扩展和实现。通过对两个示例学习算法的经验比较证明了所提出的评估方法的实用性。

著录项

来源
《Machine Learning》 |2011年第2期|p.51-80|共30页
作者
Peter Vamplew; Richard Dazeley; Adam Berry; Rustam Issabekov; Evan Dekker;
展开▼
作者单位

Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

CSIRO Energy Centre, 10 Murray Dwyer Circuit, Steel River Estate, Mayfield West, New South Wales,2304, Australia;

Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
multiobjective reinforcement learning; multiple objectives; empirical methods; pareto fronts; pareto optimal policies;

机译：多目标强化学习;多目标;经验方法;相较前沿;相较于最优策略;

相似文献

外文文献
中文文献
专利

1. Performance evaluation of evolutionary multiobjective optimization algorithms for multiobjective fuzzy genetics-based machine learning [J] . Hisao Ishibuchi, Yusuke Nakashima, Yusuke Nojima Soft Computing - A Fusion of Foundations, Methodologies and Applications . 2011,第12期

机译：基于多目标模糊遗传学的机器学习进化多目标优化算法性能评估
2. Performance evaluation of evolutionary multiobjective optimization algorithms for multiobjective fuzzy genetics-based machine learning [J] . Ishibuchi H., Nakashima Y., Nojima Y. Soft computing: A fusion of foundations, methodologies and applications . 2011,第12期

机译：基于多目标模糊遗传学的机器学习进化多目标优化算法性能评估
3. Q-Managed: A new algorithm for a multiobjective reinforcement learning [J] . Oliveira Thiago Henrique Freire de, Medeiros Luiz Paulo de Souza, Neto Adriao Duarte Doria, Expert systems with applications . 2021,第Apra期

机译：Q-Managed：一种用于多目标强化学习的新算法
4. An Empirical Comparison of Two Common Multiobjective Reinforcement Learning Algorithms [C] . Rustam Issabekov, Peter Vamplew Australasian joint conference on artificial intelligence . 2012

机译：两种常见的多目标强化学习算法的经验比较
5. Improved empirical methods in reinforcement-learning evaluation [D] . Marivate, Vukosi N. 2015

机译：强化学习评估中改进的经验方法
6. Myocardial infarction evaluation from stopping time decision toward interoperable algorithmic states in reinforcement learning [O] . Jong-Rul Park, Sung Phil Chung, Sung Yeon Hwang, 2020

机译：从钢筋学习中停止时间决定的心肌梗死评估
7. Empirical evaluation methods for multiobjective reinforcement learning algorithms [O] . Peter Vamplew, Richard Dazeley, Adam Berry, 2010

机译：多目标加固学习算法的实证评价方法

Empirical evaluation methods for multiobjective reinforcement learning algorithms

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅