首页> 外文会议>International Joint Conference on Neural Networks >A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks
【24h】

A Reinforcement Learning Method for Continuous Domains Using Artificial Hydrocarbon Networks

机译:基于人工烃网络的连续域强化学习方法

获取原文

摘要

Reinforcement learning in continuous states and actions has been limitedly studied in ocassions given difficulties in the determination of the transition function, lack of performance in continuous-to-discrete relaxation problems, among others. For instance, real-world problems, e.g. robotics, require these methods for learning complex tasks. Thus, in this paper, we propose a method for reinforcement learning with continuous states and actions using a model-based approach learned with artificial hydrocarbon networks (AHN). The proposed method considers modeling the dynamics of the continuous task with the supervised AHN method. Initial random rollouts and posterior data collection from policy evaluation improve the training of the AHN-based dynamics model. Preliminary results over the well-known mountain car task showed that artificial hydrocarbon networks can contribute to model-based approaches in continuous RL problems in both estimation efficiency (0.0012 in root mean squared-error) and sub-optimal policy convergence (reached in 357 steps), in just 5 trials over a parameter space θ ∈ R86. Data from experimental results are available at: http://sites.google.com/up.edu.mx/reinforcement-learning/http://sites.google.com/up.edu.mx/reinforcement-learning/.
机译:由于在确定过渡函数时遇到困难,在连续到离散的松弛问题中缺乏表现等原因,在连续状态和动作中的强化学习受到了有限的研究。例如,现实世界中的问题,例如机器人技术需要这些方法来学习复杂的任务。因此,在本文中,我们提出了一种通过使用基于人工碳氢化合物网络(AHN)的基于模型的方法对连续状态和动作进行强化学习的方法。提出的方法考虑使用监督的AHN方法对连续任务的动力学进行建模。初始随机部署和从策略评估中收集后验数据可改善基于AHN的动力学模型的训练。一项著名的山地车任务的初步结果表明,人工碳氢化合物网络可以在连续估计的效率(均方根误差为0.0012)和次优策略收敛(达到357个步骤)方面为连续RL问题中基于模型的方法做出贡献),仅需对参数空间θ∈R进行5次试验 86 。实验结果的数据可从以下网址获得:http://sites.google.com/up.edu.mx/reinforcement-learning/http://sites.google.com/up.edu.mx/reinforcement-learning/。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号