首页> 外文会议>International Conference on Machine Learning and Applications >Ensembles of Neural Networks for Robust Reinforcement Learning
【24h】

Ensembles of Neural Networks for Robust Reinforcement Learning

机译:神经网络巩固强大的强化学习的集合

获取原文

摘要

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their training and the validation of final policies can be cumbersome as neural networks can suffer from problems like local minima or over fitting. When using iterative methods, such as neural fitted Q-iteration, the problem becomes even more pronounced since the network has to be trained multiple times and the training process in one iteration builds on the network trained in the previous iteration. Therefore errors can accumulate. In this paper we propose to use ensembles of networks to make the learning process more robust and produce near-optimal policies more reliably. We name various ways of combining single networks to an ensemble that results in a final ensemble policy and show the potential of the approach using a benchmark application. Our experiments indicate that majority voting is superior to Q-averaging and using heterogeneous ensembles (different network topologies) is advisable.
机译:使用神经网络作为函数近似器的强化学习算法已被证明是解决最佳控制问题的强大工具。然而,他们的培训和最终政策的验证可能是麻烦的,因为神经网络可能遭受局部最小值等问题或过度拟合。使用迭代方法(例如神经拟合Q迭代)时,问题变得更加明显,因为网络必须多次培训,并且在一个迭代中的培训过程构建在先前迭代中培训的网络上构建。因此错误可以累积。在本文中,我们建议使用网络的集合来使学习过程更加强大,更可靠地生产近最佳策略。我们命名各种方式,将单网组合到导致最终集合策略的集合,并使用基准应用显示方法的潜力。我们的实验表明,大多数投票优于Q平均值,并使用异构集合(不同的网络拓扑)是可取的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号