首页> 外文OA文献 >Automatic discretization of actions and states in Monte-Carlo tree search
【2h】

Automatic discretization of actions and states in Monte-Carlo tree search

机译:蒙特卡洛树搜索中动作和状态的自动离散化

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。
获取外文期刊封面目录资料

摘要

While Monte Carlo Tree Search (MCTS) represented a revolution in game related AI research, it is currently unfit for tasks that deal with continuous actions and (often as a consequence) game-states. Recent applications of MCTS to quasi continuous games such as no-limit Poker variants have circumvented this problem by discretizing the action or the state-space. We present Tree Learning Search (TLS) as an alternative to a priori discretization. TLS employs ideas from data stream mining to combine incremental tree induction with MCTS to construct game-state-dependent discretizations that allow MCTS to focus its sampling spread more efficiently on regions of the search space with promising returns. We evaluate TLS on global function optimization problems to illustrate its potential and show results from an early implementation on a full scale no-limit Texas Hold'em Poker bot.
机译:蒙特卡洛树搜索(MCTS)代表了与游戏相关的AI研究的一场革命,但目前不适合用于处理连续动作和(通常是)游戏状态的任务。 MCTS在准连续游戏(例如无限注扑克变体)中的最新应用通过使动作或状态空间离散化来解决此问题。我们提出树学习搜索(TLS)作为先验离散化的替代方法。 TLS利用数据流挖掘中的思想,将增量树归纳法与MCTS相结合,以构造依赖于游戏状态的离散化,从而使MCTS可以将其采样分布更有效地集中在具有可观回报的搜索空间区域。我们对全局功能优化问题上的TLS进行了评估,以说明其潜力,并展示了在不限规模的德州扑克扑克机器人中早期实施的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号