首页> 外文会议>International Conference on Learning and Intelligent Optimization >Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces
【24h】

Rover Descent: Learning to Optimize by Learning to Navigate on Prototypical Loss Surfaces

机译:流浪者血液:学习通过学习优化来导航原型损失表面

获取原文

摘要

Learning to optimize - the idea that we can learn from data algorithms that optimize a numerical criterion - has recently been at the heart of a growing number of research efforts. One of the most challenging issues within this approach is to learn a policy that is able to optimize over classes of functions that are different from the classes that the policy was trained on. We propose a novel way of framing learning to optimize as a problem of learning a good navigation policy on a partially observable loss surface. To this end, we develop Rover Descent, a solution that allows us to learn a broad optimization policy from training only on a small set of prototypical two-dimensional surfaces that encompasses classically hard cases such as valleys, plateaus, cliffs and saddles and by using strictly zeroth-order information. We show that, without having access to gradient or curvature information, we achieve fast convergence on optimization problems not presented at training time, such as the Rosenbrock function and other two-dimensional hard functions. We extend our framework to optimize over high dimensional functions and show good preliminary results.
机译:学习优化 - 我们可以从数据算法中学习的想法,以优化数值标准 - 最近一直处于越来越多的研究工作中的核心。这种方法中最具挑战性的问题之一是学习能够优化与策略培训的课程不同的函数的策略。我们提出了一种新颖的帧学习方式,以优化作为在部分可观察的损失表面上学习良好导航政策的问题。为此,我们开发流浪者血液,一个解决方案,允许我们从训练中汲取广泛的优化政策,这些解决方案仅仅是一小套原型的二维表面,包括山谷,强力,悬崖和鞍座等经典硬壳,以及通过使用严格的零点信息。我们表明,在不访问梯度或曲率信息的情况下,我们在培训时间未呈现的优化问题上实现快速收敛,例如Rosenbrock函数和其他二维硬功能。我们扩展了我们的框架,以优化高维功能,并显示出良好的初步结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号