首页> 外文期刊>電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing >An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning
【24h】

An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning

机译:基于强化学习的对手特征自适应策略模型

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

In order to create a robot brain having intelligent action strategies, we proposed a model for making strategy for winning a game. During a game, It can make several strategies, and adaptively select/switch them to opponent feature change. For strategy making algorithm, Q-PSP reinforced learning are used because of faster learning speed. Selection and switching of the formed strategies are done based on the similarity between two kinds of Q-functions: (1) Q{sub}x is obtained at each strategy learning, and (2) Q{sub}m is used to recognize features of an opponent. We made a simulation program for an air hockey game based on the proposed strategy model. As the results of simulation, we confirmed the operations of strategy making and selection/switching, and evaluate the effectiveness of the proposed model.
机译:为了创建具有智能行动策略的机器人大脑,我们提出了一种制定胜利奖励游戏的模型。 在游戏期间,它可以进行多种策略,并自适应地选择/切换到对手功能改变。 对于策略制作算法,由于学习速度更快地使用Q-PSP增强学习。 选择和切换所形成的策略是基于两种Q函数之间的相似性完成:(1)在每个策略学习中获得的Q {Sub} x,并且(2)Q {Sub} M用于识别功能 对手。 我们为基于所提出的策略模型进行了潮流游戏进行了仿真程序。 作为仿真结果,我们确认了战略制作和选择/切换的操作,并评估了所提出的模型的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号