On the Dynamics of Gradient Descent for Autoencoders

Thanh V. Nguyen; Raymond K. W. Wong; Chinmay Hegde

首页> 外文期刊>JMLR: Workshop and Conference Proceedings >On the Dynamics of Gradient Descent for Autoencoders

【24h】

On the Dynamics of Gradient Descent for Autoencoders

机译：关于自动化器梯度下降的动态

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

We provide a series of results for unsupervised learning with autoencoders. Specifically, we study shallow two-layer autoencoder architectures with shared weights. We focus on three generative models for data that are common in statistical machine learning: (i) the mixture-of-gaussians model, (ii) the sparse coding model, and (iii) the sparsity model with non-negative coefficients. For each of these models, we prove that under suitable choices of hyperparameters, architectures, and initialization, autoencoders learned by gradient descent can successfully recover the parameters of the corresponding model. To our knowledge, this is the first result that rigorously studies the dynamics of gradient descent for weight-sharing autoencoders. Our analysis can be viewed as theoretical evidence that shallow autoencoder modules indeed can be used as feature learning mechanisms for a variety of data models, and may shed insight on how to train larger stacked architectures with autoencoders as basic building blocks.

机译：我们为与AutoEncoders无监督学习提供了一系列结果。具体而言，我们研究具有共享权重的浅两层AutoEncoder架构。我们专注于三个生成模型，用于统计机器学习中常见的数据：（i）稀疏编码模型（ii）稀疏编码模型，（iii）具有非负系数的稀疏模型。对于这些模型中的每一个，我们证明，在具有梯度下降的合适选择的超参数，架构和初始化的基础上，可以成功恢复相应模型的参数。为我们的知识，这是第一个严格研究重量共享自身额度梯度下降的动态的结果。我们的分析可以被视为理论上证据，即浅宇的AutoEncoder模块确实可以用作各种数据模型的特征学习机制，并且可以阐述如何使用AutoEncoders作为基本构建块培训更大的堆叠体系结构。

著录项

来源
《JMLR: Workshop and Conference Proceedings》 |2018年第2010期|共10页
作者
Thanh V. Nguyen; Raymond K. W. Wong; Chinmay Hegde;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2018,第12期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
2. Accelerated Gradient Descent Escapes Saddle Points Faster than Gradient Descent [J] . Chi Jin, Praneeth Netrapalli, Michael I. Jordan JMLR: Workshop and Conference Proceedings . 2017,第1期

机译：加速梯度下降比梯度下降更快地逃避了鞍点
3. Learning to Learn without Gradient Descent by Gradient Descent [J] . Yutian Chen, Matthew W. Hoffman, Sergio Gómez Colmenarejo, JMLR: Workshop and Conference Proceedings . 2017,第2009期

机译：通过梯度下降学习无梯度下降的学习
4. Comparision of Solutions of Numerical Gradient Descent Method and Continous Time Gradient Descent Dynamics and Lyapunov Stability [C] . Nagihan Yağmur, Barış Baykant Alagöz Signal Processing and Communications Applications Conference . 2019

机译：数值梯度下降法，连续时间梯度下降动力学和Lyapunov稳定性解的比较
5. An Investigation of Stochastic Gradient Descent Dynamics of Neural Networks [D] . Luo, Victor. 2021

机译：神经网络随机梯度下降动力学研究
6. Mutual Information Based Learning Rate Decay for Stochastic Gradient Descent Training of Deep Neural Networks [O] . Shrihari Vasudevan 2020

机译：基于互动信息的学习速率衰减用于深神经网络的随机梯度血统训练
7. On-Line Learning Theory of Soft Committee Machines with Correlated Hidden Units - Steepest Gradient Descent and Natural Gradient Descent - [O] . Inoue, Masato, Park, Hyeyoung, Okada, Masato 2002

机译：具有相关性的软委员会机器在线学习理论隐藏单位 - 最陡的梯度下降和自然梯度下降 -
8. First-Order Gradient Descent Training of Adaptive Discrete-Time Dynamic Networks [R] . Piche, S. W., Widrow, B. 1991

机译：自适应离散时间动态网络的一阶梯度下降训练

On the Dynamics of Gradient Descent for Autoencoders

摘要

著录项

相似文献

相关主题

期刊订阅