首页> 外文会议>Conference on uncertainty in artificial intelligence >Monte-Carlo Planning: Theoretically Fast Convergence Meets Practical Efficiency
【24h】

Monte-Carlo Planning: Theoretically Fast Convergence Meets Practical Efficiency

机译:蒙特卡洛规划:理论上的快速收敛满足实际效率

获取原文
获取外文期刊封面目录资料

摘要

Popular Monte-Carlo tree search (MCTS) algorithms for online planning, such as ε-greedy tree search and UCT, aim at rapidly identifying a reasonably good action, but provide rather poor worst-case guarantees on performance improvement over time. In contrast, a recently introduced MCTS algorithm BRUE guarantees exponential-rate improvement over time, yet it is not geared towards identifying reasonably good choices right at the go. We take a stand on the individual strengths of these two classes of algorithms, and show how they can be effectively connected. We then rationalize a principle of "selective tree expansion", and suggest a concrete implementation of this principle within MCTS. The resulting algorithms favorably compete with other MCTS algorithms under short planning times, while preserving the attractive convergence properties of BRUE.
机译:流行的用于在线计划的蒙特卡洛树搜索(MCTS)算法,例如ε-贪心树搜索和UCT,旨在快速识别合理的好动作,但会随着时间的推移为性能改善提供最差的最坏情况保证。相比之下,最近推出的MCTS算法BRUE可以保证随着时间的推移指数速率的提高,但它并不适合于随时随地识别合理的好选择。我们站在这两类算法的各自优势上,展示了如何有效地连接它们。然后,我们合理化“选择性树扩展”的原理,并建议在MCTS中对该原理的具体实现。所得算法在较短的计划时间内即可与其他MCTS算法竞争,同时保留了BRUE的吸引人的收敛特性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号