Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces

机译：流浪者血液：学习通过学习优化来导航原型损失表面

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning to optimize - the idea that we can learn from data algorithms that optimize a numerical criterion - has recently been at the heart of a growing number of research efforts. One of the most challenging issues within this approach is to learn a policy that is able to optimize over classes of functions that are different from the classes that the policy was trained on. We propose a novel way of framing learning to optimize as a problem of learning a good navigation policy on a partially observable loss surface. To this end, we develop Rover Descent, a solution that allows us to learn a broad optimization policy from training only on a small set of prototypical two-dimensional surfaces that encompasses classically hard cases such as valleys, plateaus, cliffs and saddles and by using strictly zeroth-order information. We show that, without having access to gradient or curvature information, we achieve fast convergence on optimization problems not presented at training time, such as the Rosenbrock function and other two-dimensional hard functions. We extend our framework to optimize over high dimensional functions and show good preliminary results.

机译：学习优化 - 我们可以从数据算法中学习的想法，以优化数值标准 - 最近一直处于越来越多的研究工作中的核心。这种方法中最具挑战性的问题之一是学习能够优化与策略培训的课程不同的函数的策略。我们提出了一种新颖的帧学习方式，以优化作为在部分可观察的损失表面上学习良好导航政策的问题。为此，我们开发流浪者血液，一个解决方案，允许我们从训练中汲取广泛的优化政策，这些解决方案仅仅是一小套原型的二维表面，包括山谷，强力，悬崖和鞍座等经典硬壳，以及通过使用严格的零点信息。我们表明，在不访问梯度或曲率信息的情况下，我们在培训时间未呈现的优化问题上实现快速收敛，例如Rosenbrock函数和其他二维硬功能。我们扩展了我们的框架，以优化高维功能，并显示出良好的初步结果。

著录项

来源
《International Conference on Learning and Intelligent Optimization》|2019年|474p|共17页
会议地点
作者
Louis Faury; Flavian Vasile;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP18-53;
关键词
入库时间 2022-08-20 20:17:38

相似文献

外文文献
中文文献
专利

1. Navigating with peripheral field loss in a museum: learning impairments due to environmental complexity [J] . Erica M. Barhorst-Cates, Kristina M. Rand, Sarah H. Creem-Regehr Cognitive Research: Principles and Implications . 2019,第1期

机译：在博物馆的外围环境损失中导航：环境复杂性导致学习障碍
2. Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates [J] . Journal of complexity . 2020,第Apra期

机译：随机梯度下降优化算法的较低误差范围：缓慢和快速衰减的学习速率的快速收敛速度
3. Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning: Examining Distributed and Centralized Stochastic Gradient Descent [J] . Pu Shi, Olshevsky Alex, Paschalidis Ioannis Ch. IEEE Signal Processing Magazine . 2020,第3期

机译：机器学习分布式随机优化中的渐近网络独立性：检查分布式和集中式随机梯度下降
4. Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces [C] . Louis Faury, Flavian Vasile International Conference on Learning and Intelligent Optimization . 2019

机译：流浪者血液：学习通过学习优化来导航原型损失表面
5. Self learning strategies for experimental design and response surface optimization. [D] . Alaeddini, Adel. 2011

机译：用于实验设计和响应面优化的自学习策略。
6. Navigating with peripheral field loss in a museum: learning impairments due to environmental complexity [O] . Erica M. Barhorst-Cates, Kristina M. Rand, Sarah H. Creem-Regehr 2019

机译：在博物馆的外围环境损失中导航：环境复杂性导致学习障碍
7. Navigating with peripheral field loss in a museum: learning impairments due to environmental complexity [O] . Erica M. Barhorst-Cates, Kristina M. Rand, Sarah H. Creem-Regehr 2019

机译：在博物馆中使用外围场地损失导航：由于环境复杂性，学习障碍

Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces

摘要

著录项

相似文献

相关主题

期刊订阅