Natural gradient descent is not affected by the correlation of hidden layer units in multilayer perceptrons

Masato Inoue; Hyeyoung Park; Masato Okada

首页> 外文期刊>電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding >Natural gradient descent is not affected by the correlation of hidden layer units in multilayer perceptrons

【24h】

Natural gradient descent is not affected by the correlation of hidden layer units in multilayer perceptrons

机译：Natural gradient descent is not affected by the correlation of hidden layer units in multilayer perceptrons

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相关主题

摘要

The permutation symmetry of the hidden units in multilayer perceptrons causes the saddle structure and plateaus of the learning dynamics in gradient learning methods. The correlation of the weight vectors in the teacher network is supposed to affect this saddle structure resulting in the prolonged learning time, but this mechanism is still unclear. In this paper, we discuss it with regard to the soft committee machines and the on-line learning using statistical mechanics. Conventional steepest gradient descent needs longer time depending on the correlation of the weight vectors. On the other hand, natural gradient descent has no plateaus in the limit of the small learning rate even though the weight vectors have the strong correlation, which worsen the singularity of the Fisher information matrix. Analytical results supports these dynamics around the saddle point.

著录项

来源
《電子情報通信学会技術研究報告. パターン認識·メディア理解. Pattern Recognition and Media Understanding》 |2002年第379期|59-64|共6页
作者
Masato Inoue; Hyeyoung Park; Masato Okada;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种日语
中图分类图像通信、多媒体通信;
关键词
Natural gradient descent; Perceptron; Soft committee machine; Singularity; Saddle; Plateau;
入库时间 2024-01-25 20:00:14

Natural gradient descent is not affected by the correlation of hidden layer units in multilayer perceptrons

摘要

著录项

相关主题

期刊订阅