首页> 外文学位 >Coping with the curse of dimensionality by combining linear programming and reinforcement learning.
【24h】

Coping with the curse of dimensionality by combining linear programming and reinforcement learning.

机译:通过将线性规划和强化学习相结合,应对维度的诅咒。

获取原文
获取原文并翻译 | 示例

摘要

Reinforcement learning techniques offer a very powerful method of finding solutions in unpredictable problem environments where human supervision is not possible. However, in many real world situations, the state space needed to represent the solutions becomes so large that using these methods becomes infeasible. Often the vast majority of these states are not valuable in finding the optimal solution.;This work introduces a novel method of using linear programming to identify and represent the small area of the state space that is most likely to lead to a near-optimal solution, significantly reducing the memory requirements and time needed to arrive at a solution.;An empirical study is provided to show the validity of this method with respect to a specific problem in vehicle dispatching. This study demonstrates that, in problems that are too large for a traditional reinforcement learning agent, this new approach yields solutions that are a significant improvement over other nonlearning methods. In addition, this new method is shown to be robust to changing conditions both during training and execution.;Finally, some areas of future work are outlined to introduce how this new approach might be applied to additional problems and environments.
机译:强化学习技术提供了一种非常强大的方法,可以在无法人工监督的无法预测的问题环境中找到解决方案。但是,在许多现实世界中,表示解决方案所需的状态空间变得如此之大,以至于无法使用这些方法。通常,这些状态中的绝大多数对寻找最优解没有价值。这项工作介绍了一种使用线性规划来识别和表示状态空间中很小的区域的新方法,这很可能导致接近最优的解决方案;通过实证研究显示该方法对于车辆调度中的特定问题的有效性。这项研究表明,对于传统的强化学习代理来说,问题太大了,这种新方法所产生的解决方案比其他非学习方法有了显着改进。此外,该新方法显示出在训练和执行过程中对条件变化的鲁棒性。最后,概述了未来工作的某些领域,以介绍如何将该新方法应用于其他问题和环境。

著录项

  • 作者

    Burton, Scott H.;

  • 作者单位

    Utah State University.;

  • 授予单位 Utah State University.;
  • 学科 Operations Research.;Artificial Intelligence.;Computer Science.
  • 学位 M.S.
  • 年度 2010
  • 页码 66 p.
  • 总页数 66
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号