Learning to Learn without Gradient Descent by Gradient Descent

Yutian Chen; Matthew W. Hoffman; Sergio Gómez Colmenarejo; Misha Denil; Timothy P. Lillicrap; Matt Botvinick; Nando Freitas

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >Learning to Learn without Gradient Descent by Gradient Descent

【24h】

Learning to Learn without Gradient Descent by Gradient Descent

机译：通过梯度下降学习无梯度下降的学习

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We learn recurrent neural network optimizers trained on simple synthetic functions by gradient descent. We show that these learned optimizers exhibit a remarkable degree of transfer in that they can be used to efficiently optimize a broad range of derivative-free black-box functions, including Gaussian process bandits, simple control objectives, global optimization benchmarks and hyper-parameter tuning tasks. Up to the training horizon, the learned optimizers learn to trade-off exploration and exploitation, and compare favourably with heavily engineered Bayesian optimization packages for hyper-parameter tuning.

机译：我们学习递归神经网络优化器，通过梯度下降对简单的合成函数进行训练。我们表明，这些博学的优化器表现出非凡的转移程度，因为它们可用于有效地优化广泛的无导数黑盒函数，包括高斯过程强盗，简单的控制目标，全局优化基准和超参数调整任务。在培训之前，有学识的优化人员将学会权衡探索和开发，并与用于超级参数调整的精心设计的贝叶斯优化软件包进行比较。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2017年第2009期|共9页
作者
Yutian Chen; Matthew W. Hoffman; Sergio Gómez Colmenarejo; Misha Denil; Timothy P. Lillicrap; Matt Botvinick; Nando Freitas;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类人工智能理论;
关键词

相似文献

外文文献
中文文献
专利

1. The gradient clusteron: A model neuron that learns to solve classification tasks via dendritic nonlinearities, structural plasticity, and gradient descent [J] . Toviah Moldwin, Menachem Kalmenson, an Segev PLoS Computational Biology . 2021,第5期

机译：渐变群集：一个模型神经元，用于通过树突非线性，结构可塑性和梯度下降来解决分类任务
2. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
3. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
4. Learning to Learn without Gradient Descent by Gradient Descent [C] . Yutian Chen, Matthew W. Hoffman, Sergio Gomez Colmenarejo, International Conference on Machine Learning . 2018

机译：学习在没有梯度下降的情况下没有梯度下降
5. On the Ability of Gradient Descent to Learn Neural Networks [D] . Li, Yuanzhi. 2018

机译：梯度下降学习神经网络的能力
6. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练
7. On-Line Learning Theory of Soft Committee Machines with Correlated Hidden Units - Steepest Gradient Descent and Natural Gradient Descent - [O] . Inoue, Masato, Park, Hyeyoung, Okada, Masato 2002

机译：具有相关性的软委员会机器在线学习理论隐藏单位 - 最陡的梯度下降和自然梯度下降 -

Learning to Learn without Gradient Descent by Gradient Descent

摘要

著录项

相似文献

相关主题

期刊订阅