首页> 外文会议>Joint International Conference on Artificial Neural Networks and Neural Information Processing - ICANN/ICONIP 2003 Jun 26-29, 2003 Istanbul, Turkey >Acceleration of Game Learning with Prediction-Based Reinforcement Learning -Toward the Emergence of Planning Behavior
【24h】

Acceleration of Game Learning with Prediction-Based Reinforcement Learning -Toward the Emergence of Planning Behavior

机译:借助基于预测的强化学习来加速游戏学习-走向计划行为

获取原文
获取原文并翻译 | 示例

摘要

When humans solve a problem, it is unlikely that they use only the current state of the problem to decide upon an action. It is difficult to explain the human action decision strategy by means of the state to action model, which is the major method used in conventional reinforcement learning (RL). On the contrary, humans appear to predict a future state through the use of past experience and decide upon an action based on that predicted state. In this paper, we propose a prediction-based RL model (PRLmodel). In the PRL model, a state prediction module and an action memory module are added to an actor-critic type RL, and the system predicts and evaluates a future state from a current one based on an expected value table. Then, the system chooses a point of action decision in order to perform the appropriate action. To evaluate the proposed model, we perform a computer simulation using a simple ping pong game. We also discuss the possibility that the PRL model may represent an evolutional change in conventional RL as well as a step toward modeling of hmuan planning behavior, because state prediction and its evaluation are the basic elements of planning in symbolic AI.
机译:当人类解决问题时,他们不太可能仅使用问题的当前状态来决定某个动作。很难通过状态到行为模型来解释人类行为决策策略,这是常规强化学习(RL)中使用的主要方法。相反,人类似乎可以通过使用过去的经验来预测未来的状态,并根据该预测的状态来决定某个动作。在本文中,我们提出了一种基于预测的RL模型(PRLmodel)。在PRL模型中,将状态预测模块和动作存储模块添加到行为者批判类型RL,并且系统根据期望值表从当前状态预测并评估未来状态。然后,系统选择一个动作点决策以执行适当的动作。为了评估提出的模型,我们使用一个简单的乒乓球游戏进行了计算机仿真。我们还讨论了PRL模型可能代表传统RL的演变变化以及向hmuan规划行为建模的步骤的可能性,因为状态预测及其评估是符号AI中规划的基本要素。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号