An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning

Masahiro Ono; Mitsuru Shiozaki; Mamoru SasakiAtsushi Iwata

首页> 外文期刊>電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing >An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning

【24h】

An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning

机译：An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

In order to create a robot brain having intelligent action strategies, we proposed a model for making strategy for winning a game. During a game, It can make several strategies, and adaptively select/switch them to opponent feature change. For strategy making algorithm, Q-PSP reinforced learning are used because of faster learning speed. Selection and switching of the formed strategies are done based on the similarity between two kinds of Q-functions: (1) Q{sub}x is obtained at each strategy learning, and (2) Q{sub}m is used to recognize features of an opponent. We made a simulation program for an air hockey game based on the proposed strategy model. As the results of simulation, we confirmed the operations of strategy making and selection/switching, and evaluate the effectiveness of the proposed model.

著录项

来源
《電子情報通信学会技術研究報告. ニュ-ロコンピュ-ティング. Neurocomputing》 |2003年第228期|61-66|共6页
作者
Masahiro Ono; Mitsuru Shiozaki; Mamoru SasakiAtsushi Iwata;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种日语
中图分类人工智能理论;
关键词
Brain of robot; Strategy model; Reinforcement learning; Q-function; Strategy making; Strategy selecting/switching;

An Adaptive Strategy Model for Opponent's Characteristics based on Reinforcement Learning

摘要

著录项

相关主题

期刊订阅