Ensembles of Neural Networks for Robust Reinforcement Learning

机译：神经网络巩固强大的强化学习的集合

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reinforcement learning algorithms that employ neural networks as function approximators have proven to be powerful tools for solving optimal control problems. However, their training and the validation of final policies can be cumbersome as neural networks can suffer from problems like local minima or over fitting. When using iterative methods, such as neural fitted Q-iteration, the problem becomes even more pronounced since the network has to be trained multiple times and the training process in one iteration builds on the network trained in the previous iteration. Therefore errors can accumulate. In this paper we propose to use ensembles of networks to make the learning process more robust and produce near-optimal policies more reliably. We name various ways of combining single networks to an ensemble that results in a final ensemble policy and show the potential of the approach using a benchmark application. Our experiments indicate that majority voting is superior to Q-averaging and using heterogeneous ensembles (different network topologies) is advisable.

机译：使用神经网络作为函数近似器的强化学习算法已被证明是解决最佳控制问题的强大工具。然而，他们的培训和最终政策的验证可能是麻烦的，因为神经网络可能遭受局部最小值等问题或过度拟合。使用迭代方法（例如神经拟合Q迭代）时，问题变得更加明显，因为网络必须多次培训，并且在一个迭代中的培训过程构建在先前迭代中培训的网络上构建。因此错误可以累积。在本文中，我们建议使用网络的集合来使学习过程更加强大，更可靠地生产近最佳策略。我们命名各种方式，将单网组合到导致最终集合策略的集合，并使用基准应用显示方法的潜力。我们的实验表明，大多数投票优于Q平均值，并使用异构集合（不同的网络拓扑）是可取的。

著录项

来源
《International Conference on Machine Learning and Applications》|2010年||共6页
会议地点
作者
Hans Alexander; Udluft Steffen;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP181-53;
关键词
ensemble methods; neural fitted Q-iteration; neural networks; reinforcement learning; robustness;

机译：合奏方法;神经拟合Q迭代;神经网络;加强学习;坚固性;

相似文献

外文文献
中文文献
专利

1. Robust Reinforcement Learning Control Using Integral Quadratic Constraints for Recurrent Neural Networks [J] . Anderson C. W., Young P. M., Buehner M. R., IEEE Transactions on Neural Networks . 2007,第4期

机译：递归神经网络的积分二次约束鲁棒强化学习控制
2. Neural Network Ensembles in Reinforcement Learning [J] . Stefan Fausser, Friedhelm Schwenker Neural processing letters . 2015,第1期

机译：增强学习中的神经网络集成
3. Neural Network Ensembles in Reinforcement Learning [J] . Stefan Faußer, Friedhelm Schwenker Neural Processing Letters . 2015,第1期

机译：增强学习中的神经网络集成
4. Ensembles of Neural Networks for Robust Reinforcement Learning [C] . Hans Alexander, Udluft Steffen Ninth International Conference on Machine Learning and Applications . 2010

机译：神经网络的集成，用于增强学习
5. Adversarial Robustness and Robust Meta-Learning for Neural Networks [D] . Goldblum, Micah. 2020

机译：对神经网络的对抗鲁棒性和强大的元学习
6. PNAS Plus: Unreasonable effectiveness of learning neural networks: From accessible states and robust ensembles to basic algorithmic schemes [O] . Carlo Baldassi, Christian Borgs, Jennifer T. Chayes, 2017

机译：PNAS Plus：学习神经网络的有效性不合理：从可访问状态和鲁棒集成到基本算法方案
7. Unreasonable effectiveness of learning neural networks: From accessible states and robust ensembles to basic algorithmic schemes [O] . Baldassi, Carlo, Borgs, Christian, Chayes, Jennifer T, 2016

机译：学习神经网络的有效性不合理：从可访问状态和鲁棒合奏到基本算法方案

Ensembles of Neural Networks for Robust Reinforcement Learning

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅