Robust Learning for Adaptive Programs by Leveraging Program Structure

机译：通过利用程序结构对自适应程序进行鲁棒的学习

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We study how to effectively integrate reinforcement learning (RL) and programming languages via adaptation-based programming, where programs can include non-deterministic structures that can be automatically optimized via RL. Prior work has optimized adaptive programs by defining an induced sequential decision process to which standard RL is applied. Here we show that the success of this approach is highly sensitive to the specific program structure, where even seemingly minor program transformations can lead to failure. This sensitivity makes it extremely difficult for a non-RL-expert to write effective adaptive programs. In this paper, we study a more robust learning approach, where the key idea is to leverage information about program structure in order to define a more informative decision process and to improve the SARSA(lambda) RL algorithm. Our empirical results show significant benefits for this approach.

机译：我们研究如何通过基于适应的编程有效地整合强化学习（RL）和编程语言，其中程序可以包含可以通过RL自动优化的非确定性结构。先前的工作通过定义将标准RL应用于其的诱导顺序决策过程来优化自适应程序。在这里，我们证明了这种方法的成功对特定的程序结构高度敏感，在特定的程序结构中，即使看似很小的程序转换也可能导致失败。这种敏感性使非RL专家极难编写有效的自适应程序。在本文中，我们研究了一种更强大的学习方法，其中的关键思想是利用有关程序结构的信息来定义更具参考性的决策过程并改进SARSA（lambda）RL算法。我们的经验结果表明，这种方法具有明显的优势。

著录项

来源
《Ninth International Conference on Machine Learning and Applications》|2010年|p.943-948|共6页
会议地点
作者
Pinto Jervis; Fern Alan; Bauer Tim; Erwig Martin;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词
adaptation-based programming; partial programming; reinforcement learning;

机译：基于适应的程序设计;部分程序设计;强化学习;
入库时间 2022-08-26 15:01:32

相似文献

外文文献
中文文献
专利

1. Robust Adaptive Control with Active Learning for Fed-Batch Process based on Approximate Dynamic Programming [J] . Ha-Eun Byun, Boeun Kim, Jay H. Lee IFAC PapersOnLine . 2020,第2期

机译：基于近似动态编程的FED批处理过程具有强大的自适应控制
2. Decentralized robust optimal control for modular robot manipulators via critic-identifier structure-based adaptive dynamic programming [J] . Neural computing & applications . 2020,第8期

机译：通过批评标识符结构的自适应动态编程模块化机器人操纵器的分散鲁棒优化控制
3. Challenges for leveraging citizen science to support statistically robust monitoring programs [J] . Weiser Emily L., Diffendorfer Jay E., Lopez-Hoffman Laura, Biological Conservation . 2020,第期

机译：利用公民科学支持统计上强大的监测计划的挑战
4. Robust Learning for Adaptive Programs by Leveraging Program Structure [C] . Pinto Jervis, Fern Alan, Bauer Tim, International Conference on Machine Learning and Applications . 2010

机译：通过利用程序结构来实现自适应程序的强大学习
5. Exploring Social Interventions for Computer Programming: Leveraging Learning Theories to Affect Student Social and Programming Behavior [D] . Olivares, Daniel Michael. 2019

机译：探索计算机规划的社会干预：利用学习理论，影响学生社会和编程行为
6. Adaptively Weighted and Robust Mathematical Programming for the Discovery of Driver Gene Sets in Cancers [O] . Xiaolu Xu, Pan Qin, Hong Gu, -1

机译：自适应加权和鲁棒的数学编程用于发现癌症驱动基因
7. Robust Learning for Adaptive Programs by Leveraging Program Structure [O] . Jervis Pinto, Alan Fern, Tim Bauer, 2011

机译：通过利用程序结构对自适应程序进行鲁棒的学习

Robust Learning for Adaptive Programs by Leveraging Program Structure

摘要

著录项

相似文献

相关主题

期刊订阅