首页> 外文期刊>Machine Learning >Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search
【24h】

Hypervolume indicator and dominance reward based multi-objective Monte-Carlo Tree Search

机译:基于超量指标和优势奖励的多目标蒙特卡洛树搜索

获取原文
获取原文并翻译 | 示例
       

摘要

Concerned with multi-objective reinforcement learning (MORL), this paper presents MOMCTS, an extension of Monte-Carlo Tree Search to multi-objective sequential decision making, embedding two decision rules respectively based on the hypervolume indicator and the Pareto dominance reward. The MOMCTS approaches are firstly compared with the MORL state of the art on two artificial problems, the two-objective Deep Sea Treasure problem and the three-objective Resource Gathering problem. The scalability of MOMCTS is also examined in the context of the NP-hard grid scheduling problem, showing that the MOMCTS performance matches the (non-RL based) state of the art albeit with a higher computational cost.
机译:关于多目标强化学习(MORL),本文提出了MOMCTS,它是蒙特卡罗树搜索对多目标顺序决策的扩展,它分别基于超量指标和帕累托优势奖励嵌入了两个决策规则。首先将MOMCTS方法与MORL技术在两个人为问题上进行了比较,这两个问题是两目标的深海寻宝问题和三目标的资源收集问题。还在NP硬网格调度问题的背景下检查了MOMCTS的可伸缩性,这表明MOMCTS性能与(基于非RL的)现有技术相匹配,尽管计算成本较高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号