Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces

机译：流浪者下降：通过学习在原型损失面上进行导航来学习优化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Learning to optimize - the idea that we can learn from data algorithms that optimize a numerical criterion - has recently been at the heart of a growing number of research efforts. One of the most challenging issues within this approach is to learn a policy that is able to optimize over classes of functions that are different from the classes that the policy was trained on. We propose a novel way of framing learning to optimize as a problem of learning a good navigation policy on a partially observable loss surface. To this end, we develop Rover Descent, a solution that allows us to learn a broad optimization policy from training only on a small set of prototypical two-dimensional surfaces that encompasses classically hard cases such as valleys, plateaus, cliffs and saddles and by using strictly zeroth-order information. We show that, without having access to gradient or curvature information, we achieve fast convergence on optimization problems not presented at training time, such as the Rosenbrock function and other two-dimensional hard functions. We extend our framework to optimize over high dimensional functions and show good preliminary results.

机译：学习优化-我们可以从优化数值准则的数据算法中学习的思想-最近成为越来越多的研究工作的核心。这种方法中最具挑战性的问题之一是学习一种策略，该策略能够针对与该策略所针对的功能类别不同的功能类别进行优化。我们提出了一种框架学习的新方法，以优化作为在部分可观察到的损失表面上学习良好导航策略的问题。为此，我们开发了Rover Descent，该解决方案使我们能够通过仅在少量原型二维表面上进行训练来学习广泛的优化策略，其中包括经典的困难情况，例如山谷，高原，悬崖和马鞍，并通过使用严格为零阶信息。我们表明，无需访问梯度或曲率信息，就可以在训练时未出现的优化问题上实现快速收敛，例如Rosenbrock函数和其他二维硬函数。我们扩展了框架，以优化高维函数并显示出良好的初步结果。

著录项

来源
《International Conference on Learning and Intelligent Optimization》|2018年|271-287|共17页
会议地点 Kalamata(GR)
作者
Louis Faury; Flavian Vasile;
展开▼
作者单位

Criteo Research Paris France Ecole Polytechnique Federale de Lausanne Lausanne Switzerland;

Criteo Research Paris France;

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
入库时间 2022-08-26 14:35:50

相似文献

外文文献
中文文献
专利

1. Navigating with peripheral field loss in a museum: learning impairments due to environmental complexity [J] . Erica M. Barhorst-Cates, Kristina M. Rand, Sarah H. Creem-Regehr Cognitive Research: Principles and Implications . 2019,第1期

机译：在博物馆的外围环境损失中导航：环境复杂性导致学习障碍
2. Lower error bounds for the stochastic gradient descent optimization algorithm: Sharp convergence rates for slowly and fast decaying learning rates [J] . Journal of complexity . 2020,第Apra期

机译：随机梯度下降优化算法的较低误差范围：缓慢和快速衰减的学习速率的快速收敛速度
3. Asymptotic Network Independence in Distributed Stochastic Optimization for Machine Learning: Examining Distributed and Centralized Stochastic Gradient Descent [J] . Pu Shi, Olshevsky Alex, Paschalidis Ioannis Ch. IEEE Signal Processing Magazine . 2020,第3期

机译：机器学习分布式随机优化中的渐近网络独立性：检查分布式和集中式随机梯度下降
4. Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces [C] . Louis Faury, Flavian Vasile International Conference on Learning and Intelligent Optimization . 2019

机译：流浪者血液：学习通过学习优化来导航原型损失表面
5. Self learning strategies for experimental design and response surface optimization. [D] . Alaeddini, Adel. 2011

机译：用于实验设计和响应面优化的自学习策略。
6. Navigating with peripheral field loss in a museum: learning impairments due to environmental complexity [O] . Erica M. Barhorst-Cates, Kristina M. Rand, Sarah H. Creem-Regehr 2019

机译：在博物馆的外围环境损失中导航：环境复杂性导致学习障碍
7. Navigating with peripheral field loss in a museum: learning impairments due to environmental complexity [O] . Erica M. Barhorst-Cates, Kristina M. Rand, Sarah H. Creem-Regehr 2019

机译：在博物馆中使用外围场地损失导航：由于环境复杂性，学习障碍

Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces

摘要

著录项

相似文献

相关主题

期刊订阅