首页> 外文OA文献 >Automatic discretization of actions and states in Monte-Carlo tree search

【2h】

Automatic discretization of actions and states in Monte-Carlo tree search

机译：蒙特卡洛树搜索中动作和状态的自动离散化

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

While Monte Carlo Tree Search (MCTS) represented a revolution in game related AI research, it is currently unfit for tasks that deal with continuous actions and (often as a consequence) game-states. Recent applications of MCTS to quasi continuous games such as no-limit Poker variants have circumvented this problem by discretizing the action or the state-space. We present Tree Learning Search (TLS) as an alternative to a priori discretization. TLS employs ideas from data stream mining to combine incremental tree induction with MCTS to construct game-state-dependent discretizations that allow MCTS to focus its sampling spread more efficiently on regions of the search space with promising returns. We evaluate TLS on global function optimization problems to illustrate its potential and show results from an early implementation on a full scale no-limit Texas Hold'em Poker bot.

机译：蒙特卡洛树搜索（MCTS）代表了与游戏相关的AI研究的一场革命，但目前不适合用于处理连续动作和（通常是）游戏状态的任务。 MCTS在准连续游戏（例如无限注扑克变体）中的最新应用通过使动作或状态空间离散化来解决此问题。我们提出树学习搜索（TLS）作为先验离散化的替代方法。 TLS利用数据流挖掘中的思想，将增量树归纳法与MCTS相结合，以构造依赖于游戏状态的离散化，从而使MCTS可以将其采样分布更有效地集中在具有可观回报的搜索空间区域。我们对全局功能优化问题上的TLS进行了评估，以说明其潜力，并展示了在不限规模的德州扑克扑克机器人中早期实施的结果。

著录项

作者
Van den Broeck Guy; Driessens Kurt;
展开▼
作者单位

展开▼
年度 2011
总页数
原文格式 PDF
正文语种 en
中图分类

相似文献

外文文献
中文文献
专利

1. A Monte-Carlo simulation application for automatic new topic identification of search engine transaction logs [J] . Seda Ozmutlu, Huseyin C. Ozmutlu, Buket Buyuk Simulation modelling practice and theory: International journal of the Federation of European Simulation Societies . 2008,第5期

机译：用于自动识别搜索引擎交易日志的蒙特卡洛模拟应用程序
2. Monte-Carlo tree search and rapid action value estimation in computer Go [J] . Sylvain Gelly, David Silver Artificial intelligence . 2011,第11期

机译：Go语言中的蒙特卡洛树搜索和快速动作值估计
3. Novel RFID anti-collision algorithm based on the Monte-Carlo query tree search [J] . Samsami Mohammad Mehdi, Yasrebi Navid Wireless Networks . 2021,第1期

机译：基于Monte-Carlo查询树搜索的新型RFID防碰撞算法
4. Consistency Modifications for Automatically Tuned Monte-Carlo Tree Search [C] . Vincent Berthier, Hassen Doghmen, Olivier Teytaud Learning and intelligent optimization . 2010

机译：自动调整的蒙特卡洛树搜索的一致性修改
5. Monte-Carlo tree search with heuristic knowledge: A novel way in solving capturing and life and death problems in Go. [D] . Zhang, Peigang. 2010

机译：具有启发式知识的蒙特卡洛树搜索：一种解决Go语言中捕获和生死问题的新颖方法。
6. MDTS: automatic complex materials design using Monte Carlo tree search [O] . Thaer M. Dieb, Shenghong Ju, Kazuki Yoshizoe, 2017

机译：MDTS：使用蒙特卡洛树搜索进行自动复杂材料设计
7. Consistency Modifications for Automatically Tuned Monte-Carlo Tree Search [O] . Vincent Berthier, Hassen Doghmen, Olivier Teytaud, 2010

机译：自动调整蒙特卡罗树搜索的一致性修改

Automatic discretization of actions and states in Monte-Carlo tree search

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅