首页> 外文会议>Ninth International Conference on Machine Learning and Applications >Robust Learning for Adaptive Programs by Leveraging Program Structure
【24h】

Robust Learning for Adaptive Programs by Leveraging Program Structure

机译:通过利用程序结构对自适应程序进行鲁棒的学习

获取原文

摘要

We study how to effectively integrate reinforcement learning (RL) and programming languages via adaptation-based programming, where programs can include non-deterministic structures that can be automatically optimized via RL. Prior work has optimized adaptive programs by defining an induced sequential decision process to which standard RL is applied. Here we show that the success of this approach is highly sensitive to the specific program structure, where even seemingly minor program transformations can lead to failure. This sensitivity makes it extremely difficult for a non-RL-expert to write effective adaptive programs. In this paper, we study a more robust learning approach, where the key idea is to leverage information about program structure in order to define a more informative decision process and to improve the SARSA(lambda) RL algorithm. Our empirical results show significant benefits for this approach.
机译:我们研究如何通过基于适应的编程有效地整合强化学习(RL)和编程语言,其中程序可以包含可以通过RL自动优化的非确定性结构。先前的工作通过定义将标准RL应用于其的诱导顺序决策过程来优化自适应程序。在这里,我们证明了这种方法的成功对特定的程序结构高度敏感,在特定的程序结构中,即使看似很小的程序转换也可能导致失败。这种敏感性使非RL专家极难编写有效的自适应程序。在本文中,我们研究了一种更强大的学习方法,其中的关键思想是利用有关程序结构的信息来定义更具参考性的决策过程并改进SARSA(lambda)RL算法。我们的经验结果表明,这种方法具有明显的优势。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号