首页> 外文会议>International Joint Conference on Neural Networks >Strategy Selection in Complex Game Environments Based on Transfer Reinforcement Learning
【24h】

Strategy Selection in Complex Game Environments Based on Transfer Reinforcement Learning

机译:基于转移强化学习的复杂游戏环境策略选择

获取原文

摘要

Boosting the learning process in the new task by making use of previously obtained knowledge has been a challenging task in many fields of industrial engineering and scientific. In this paper, we propose a transfer reinforcement learning model with knowledge Inheritance and decision-making Assistance (trIA). In the stage of knowledge inheritance, trIA adopts a model that employs a simultaneous multi-task and multi-instance learning strategy to compress acquired experts knowledge from distinct task into a global multi-task agent. In the stage of decision-making assistance, trIA adopts a dual-column progressive neural network framework to fully utilize the previous knowledge in the global multi-task agent and the acquired knowledge in the new task. The experimental results on the Atari domain demonstrate that the proposed knowledge inheritance model can performed at nearly the same level as the experts on the distinct source task environments. The results also demonstrate that the decision-making assistance model can transfer knowledge from the source tasks to the target tasks effectively. Moreover, the comparative results with the state-ofthe-art algorithms validate the effectiveness of the proposed trIA for strategy selection in complex game environments.
机译:在工业工程和科学的许多领域中,通过利用先前获得的知识来促进新任务中的学习过程一直是具有挑战性的任务。在本文中,我们提出了一种具有知识继承和决策协助(trIA)的转移强化学习模型。在知识继承阶段,trIA采用一种模型,该模型采用同时执行的多任务和多实例学习策略,将获得的专家知识从不同的任务压缩为全局多任务代理。在决策协助阶段,trIA采用双列渐进神经网络框架来充分利用全局多任务代理中的先前知识和新任务中获得的知识。在Atari域上的实验结果表明,所提出的知识继承模型可以在与不同源任务环境的专家几乎相同的水平上执行。结果还表明,决策辅助模型可以有效地将知识从源任务转移到目标任务。此外,最新算法的比较结果验证了所提出的trIA在复杂游戏环境中进行策略选择的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号