Strategy Acquisition for Games Based on Simplified Reinforcement Learning Using a Strategy Network

Masaaki Kanakubo; Masafumi Hagiwara

首页> 外文期刊>Journal of Advanced Computatioanl Intelligence and Intelligent Informatics >Strategy Acquisition for Games Based on Simplified Reinforcement Learning Using a Strategy Network

【24h】

Strategy Acquisition for Games Based on Simplified Reinforcement Learning Using a Strategy Network

机译：基于策略网络的简化强化学习的游戏策略获取

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We propose a simplified form of reinforcement learning (RL) for game strategy acquisition using a strategy network. RL has been applied to a number of games, such as backgammon, checkers, etc. However, the application of RL to Othello or Shogi, which have very large state spaces, is more difficult because these games take a very long time to learning. The proposed strategy network is composed of N lines from N nodes on the game board with a single evaluation node as a 2-layer perceptron. These nodes denote all possible states of every square on the game board and can easily represent the evaluation function. Moreover, these nodes can also denote imaginary states, such as pieces that may exist in the next step, or denote every positional relation of two arbitrary pieces or other various board phases. After several thousands of games had been played, the strategy network quickly acquired a better evaluation function than that using a normalized Gaussian network. The computer player employing the strategy network beat a heuristic-based player that evaluates the values of pieces or places on the game board. The proposed strategy network was able to acquire good weightings of various features of game states. In addition, the player employing the strategy network for a 4×4 Othello task after co-evolutionary training acquired a winning strategy.

机译：我们提出一种简化形式的强化学习（RL），用于使用策略网络进行游戏策略获取。 RL已应用于许多游戏，例如步步高，跳棋等。但是，将RL应用于具有很大状态空间的Othello或Shogi则更加困难，因为这些游戏需要花费很长时间学习。拟议的策略网络由游戏板上N个节点的N条线组成，其中一个评估节点作为2层感知器。这些节点表示游戏板上每个方块的所有可能状态，并且可以轻松表示评估功能。此外，这些节点还可以表示虚拟状态，例如可能存在于下一步中的零件，或者表示两个任意零件或其他各种板相的每个位置关系。在玩了数千场游戏之后，与使用归一化的高斯网络相比，策略网络迅速获得了更好的评估功能。使用策略网络的计算机玩家击败了基于启发式的玩家，该玩家评估游戏板上的棋子或位置的值。所提出的策略网络能够获得游戏状态各种特征的良好权重。另外，在经过共同进化训练后将策略网络用于4×4奥赛罗任务的玩家获得了获胜策略。

著录项

来源
《Journal of Advanced Computatioanl Intelligence and Intelligent Informatics》 |2005年第2期|共8页
作者
Masaaki Kanakubo; Masafumi Hagiwara;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类其他计算机;
关键词
Reinforcement learning; Game strategy acquisition; Neural network;

机译：强化学习;游戏策略获取;神经网络;

相似文献

外文文献
中文文献
专利

1. Strategy Acquisition for Games Based on Simplified Reinforcement Learning Using a Strategy Network [J] . Masaaki Kanakubo, Masafumi Hagiwara Journal of Advanced Computatioanl Intelligence and Intelligent Informatics . 2005,第2期

机译：基于策略网络的简化强化学习的游戏策略获取
2. A Deep Learning Algorithm for the Max-Cut Problem Based on Pointer Network Structure with Supervised Learning and Reinforcement Learning Strategies [J] . Shenshen Gu, Yue Yang Mathematics . 2020,第2期

机译：一种深入学习算法，基于指针网络结构与监督学习和加固学习策略
3. Reinforcement-learning-based dynamic defense strategy of multistage game against dynamic load altering attack [J] . Guo Youqi, Wang Lingfeng, Liu Zhaoxi, International journal of electrical power and energy systems . 2021,第Octa期

机译：基于强化学习的动态游戏动态负荷改变攻击的动态防御策略
4. Comparing Knowledge-Based Reinforcement Learning to Neural Networks in a Strategy Game [C] . Liudmyla Nechepurenko, Viktor Voss, Vyacheslav Gritsenko International Conference on Hybrid Artificial Intelligence Systems . 2020

机译：基于知识的强化学习在战略游戏中的神经网络
5. Reinforcement learning based strategies for adaptive wireless sensor network management. [D] . Shah, Kunalbhai. 2010

机译：基于增强学习的自适应无线传感器网络管理策略。
6. Lateral Intraparietal Cortex and Reinforcement Learning during a Mixed-Strategy Game [O] . Hyojung Seo, Dominic J. Barraclough, Daeyeol Lee 2009

机译：混合策略游戏中的腹侧外侧皮层和强化学习
7. Competing mobile network game: Embracing antijamming and jamming strategies with reinforcement learning [O] . Youngjune Gwon, Siamak Dastangoo, Carl Fossa, 2013

机译：竞争的移动网络游戏：通过强化学习来实现抗干扰和干扰策略

Strategy Acquisition for Games Based on Simplified Reinforcement Learning Using a Strategy Network

摘要

著录项

相似文献

相关主题

期刊订阅