首页> 外文会议>International Joint Conference on Neural Networks >Batch Reinforcement Learning on the Industrial Benchmark: First Experiences
【24h】

Batch Reinforcement Learning on the Industrial Benchmark: First Experiences

机译:批量加固在工业基准上学习:第一次经验

获取原文

摘要

The Particle Swarm Optimization Policy (PSO-P) has been recently introduced and proven to produce remarkable results on interacting with academic reinforcement learning benchmarks in an off-policy, batch-based setting. To further investigate the properties and feasibility on real-world applications, this paper investigates PSO-P on the so-called Industrial Benchmark (IB), a novel reinforcement learning (RL) benchmark that aims at being realistic by including a variety of aspects found in industrial applications, such as continuous state and action spaces, a high dimensional, partially observable state space, delayed effects, and complex stochasticity. The experimental results of PSO-P on IB are compared to results of closed-form control policies derived from the model-based Recurrent Control Neural Network (RCNN) and the model-free Neural Fitted Q-Iteration (NFQ). Experiments show that PSO-P is not only of interest for academic benchmarks, but also for real-world industrial applications, since it also yielded the best performing policy in our IB setting. Compared to other well established RL techniques, PSO-P produced outstanding results in performance and robustness, requiring only a relatively low amount of effort in finding adequate parameters or making complex design decisions.
机译:最近介绍并证明了粒子群优化政策(PSO-P)以在禁区批量批量的环境中与学术强化学习基准进行互动的显着成果。为了进一步研究现实世界应用的性质和可行性,本文调查了所谓的工业基准(IB)的PSO-P,这是一种新颖的加强学习(RL)基准,其旨在通过包括各种各样的方面来实现的现实在工业应用中,例如连续状态和动作空间,高维,部分可观察的状态空间,延迟效应和复杂的随机性。将PSO-P对IB的实验结果与从基于模型的复发控制神经网络(RCNN)和无模型神经拟合Q迭代(NFQ)的闭合形式控制政策的结果进行比较。实验表明,PSO-P不仅对学术基准的兴趣,而且对于真实世界的工业应用,因此它还在我们的IB设置中产生了最佳表演政策。与其他完善的RL技术相比,PSO-P在性能和稳健性中产生了出色的结果,只需要相对较低的努力来寻找足够的参数或制作复杂的设计决策。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号