Monte-Carlo Planning: Theoretically Fast Convergence Meets Practical Efficiency

机译：蒙特卡洛规划：理论上的快速收敛满足实际效率

获取原文

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Popular Monte-Carlo tree search (MCTS) algorithms for online planning, such as ε-greedy tree search and UCT, aim at rapidly identifying a reasonably good action, but provide rather poor worst-case guarantees on performance improvement over time. In contrast, a recently introduced MCTS algorithm BRUE guarantees exponential-rate improvement over time, yet it is not geared towards identifying reasonably good choices right at the go. We take a stand on the individual strengths of these two classes of algorithms, and show how they can be effectively connected. We then rationalize a principle of "selective tree expansion", and suggest a concrete implementation of this principle within MCTS. The resulting algorithms favorably compete with other MCTS algorithms under short planning times, while preserving the attractive convergence properties of BRUE.

机译：流行的用于在线计划的蒙特卡洛树搜索（MCTS）算法，例如ε-贪心树搜索和UCT，旨在快速识别合理的好动作，但会随着时间的推移为性能改善提供最差的最坏情况保证。相比之下，最近推出的MCTS算法BRUE可以保证随着时间的推移指数速率的提高，但它并不适合于随时随地识别合理的好选择。我们站在这两类算法的各自优势上，展示了如何有效地连接它们。然后，我们合理化“选择性树扩展”的原理，并建议在MCTS中对该原理的具体实现。所得算法在较短的计划时间内即可与其他MCTS算法竞争，同时保留了BRUE的吸引人的收敛特性。

著录项

来源
《Conference on uncertainty in artificial intelligence》|2013年|212-221|共10页
会议地点
作者
Zohar Feldman; Carmel Domshlak;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Monte-Carlo based uncertainty analysis: Sampling efficiency and sampling convergence [J] . Hans Janssen Reliability Engineering & System Safety . 2013,第JANa期

机译：基于蒙特卡洛的不确定性分析：采样效率和采样收敛
2. Practical approach to the fast Monte-Carlo ray-tracing [J] . Gruzdev A. M., Frolov V. A., Ignatenko A. V. Programming and Computer Software . 2015,第5期

机译：快速蒙特卡洛射线追踪的实用方法
3. A practical projection-based postprocessing of block-coded imageswith fast convergence rate [J] . Yeonsik Jeong, Inkyeom Kim, Hyunchul Kang IEEE Transactions on Circuits and Systems for Video Technology . 2000,第4期

机译：实用的基于投影的具有快速收敛速度的块编码图像后处理
4. Monte-Carlo Planning: Theoretically Fast Convergence Meets Practical Efficiency [C] . Zohar Feldman, Carmel Domshlak Conference on Uncertainty in Artificial Intelligence . 2013

机译：Monte-Carlo规划：理论上快速收敛符合实用效率
5. Monte-Carlo simulation of fast neutron radiolysis in the Fricke dosimeter [D] . Tippayamontri, Thititip 2009

机译：Fricke剂量计中快速中子辐解的蒙特卡罗模拟
6. Planning for operating room efficiency and faster anesthesia wake-up time in open major upper abdominal surgery [O] . Hou-Chuan Lai, Shun-Ming Chan, Chueng-He Lu, -1

机译：计划进行大型上腹部大手术时的手术室效率和更快的麻醉起床时间
7. Monte-Carlo based uncertainty analysis: Sampling efficiency and sampling convergence [O] . Janssen Hans 2013

机译：基于蒙特卡洛的不确定性分析：采样效率和采样收敛

Monte-Carlo Planning: Theoretically Fast Convergence Meets Practical Efficiency

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅