...
首页> 外文期刊>Machine Learning >Empirical evaluation methods for multiobjective reinforcement learning algorithms
【24h】

Empirical evaluation methods for multiobjective reinforcement learning algorithms

机译:多目标强化学习算法的经验评估方法

获取原文
获取原文并翻译 | 示例

摘要

While a number of algorithms for multiobjective reinforcement learning have been proposed, and a small number of applications developed, there has been very little rigorous empirical evaluation of the performance and limitations of these algorithms. This paper proposes standard methods for such empirical evaluation, to act as a foundation for future comparative studies. Two classes of multiobjective reinforcement learning algorithms are identified, and appropriate evaluation metrics and methodologies are proposed for each class. A suite of benchmark problems with known Pareto fronts is described, and future extensions and implementations of this benchmark suite are discussed. The utility of the proposed evaluation methods are demonstrated via an empirical comparison of two example learning algorithms.
机译:虽然已经提出了许多用于多目标强化学习的算法,并且开发了少量应用程序,但对这些算法的性能和局限性进行的严格经验评估很少。本文提出了这种经验评估的标准方法,以作为未来比较研究的基础。确定了两类多目标强化学习算法,并为每类提出了适当的评估指标和方法。描述了具有已知Pareto前沿的一组基准测试问题,并讨论了该基准测试套件的未来扩展和实现。通过对两个示例学习算法的经验比较证明了所提出的评估方法的实用性。

著录项

  • 来源
    《Machine Learning》 |2011年第2期|p.51-80|共30页
  • 作者单位

    Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

    Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

    CSIRO Energy Centre, 10 Murray Dwyer Circuit, Steel River Estate, Mayfield West, New South Wales,2304, Australia;

    Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

    Graduate School of Information Technology and Mathematical Sciences, University of Ballarat,P.O. Box 663, Ballarat, Victoria, 3353 Australia;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    multiobjective reinforcement learning; multiple objectives; empirical methods; pareto fronts; pareto optimal policies;

    机译:多目标强化学习;多目标;经验方法;相较前沿;相较于最优策略;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号