首页> 外文会议>International Conference on Artificial Intelligence and Soft Computing >Study of Learning Ability in Profit Sharing Using Convolutional Neural Network
【24h】

Study of Learning Ability in Profit Sharing Using Convolutional Neural Network

机译:卷积神经网络的利润共享学习能力研究

获取原文

摘要

Profit Sharing using Convolutional Neural Network (PS-CNN) has been proposed as a method of deep reinforcement learning. In the previous work, experiments have been conducted using Atari 2600's Asterix in the Profit Sharing using Convolutional Neural Networks, and it is known that a better score can be obtained than Deep Q-Network. However, experiments have not been conducted on games other than Asterix, and sufficient consideration has not been made. In this paper, we report on the results of studying learning ability for some Atari 2600' games in Profit Sharing using Convolution Neural Network. By comparing the results with the results in Deep Q-Network, we confirmed that this method can acquire higher score than the Deep Q-Network in some games. The common feature of these games is that the number of actions and the number of states are relatively large.
机译:已经提出使用卷积神经网络(PS-CNN)进行利润共享作为深度强化学习的一种方法。在先前的工作中,已经使用Atari 2600的Asterix在使用卷积神经网络进行的利润共享中进行了实验,并且已知可以获得比Deep Q-Network更好的分数。但是,除Asterix以外的游戏均未进行实验,因此未作充分考虑。在本文中,我们报告了使用卷积神经网络研究某些Atari 2600游戏在获利共享中学习能力的结果。通过将结果与Deep Q-Network中的结果进行比较,我们确认该方法在某些游戏中可以获得比Deep Q-Network更高的分数。这些游戏的共同特征是动作数量和状态数量相对较大。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号